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This  work  introduces  a  new  class  of  nonlinear  response  maps,  called 
polynomial  maps.  The  natural  internal  (state-space)  realizations  of 
polynomial  response  maps  are  studied  and  a  result  is  obtained  on 
existence  and  uniqueness  of  canonical  realizations.  Various  external 
finiteness  conditions  are  related  to  corresponding  ones  of  internal 
realizations.  In  particular,  an  investigation  is  made  into  the 
existence  of  input /output  equations.  The  tools  used  are  mainly 
algebraic-geometric.  The  subclass  of  bounded  polynomial  response 
maps  is  also  introduced;  in  this  case  many  realization  results  can  be 
derived  from  automata  and  language  theory. 


CHAPTER  I.  OUTLINE  OF  MAIN  CONTRIBUTIONS 

In  the  present  work  we  study  the  problem  of  realization  of  polynomial 
input /output  maps.  In  this  introduction  we  restrict  ourselves  to  shift- 
invariant  ,  scalar  input /output  maps  defined  over  infinite  fields,  in 
order  to  present  the  definitions  and  results  in  a  simple  way.  The 
development  in  the  main  text  proceeds  in  greater  generality. 

We  choose  an  infinite  field  k,  which  will  be  fixed  throughout  this 
section. 

let  S  denote  the  set  of  all  sequences  of  elements  of  k  indexed 

by  the  integers  and  with  support  bounded  on  the  left.  In  other  words. 

u(  • )  in  S  is  a  function  u(-)*  Z  -> k  for  which  there  exists  an 

integer  t   such  that  u(t)  =  0  for  all  t  <  t  .  A  (scalar)  input/ 
D     o  o  L- 

output  map  is  then  a  map  f :  S  ->  S:  u(  • )  >->  y(  • );  f  is  (strictly)  causal 

when,  for  all  t  in  Z,  the  output  y(t)  depends  only  on  values  u(j) 

of  inputs  for  j  <  t;  f  is  shift-invariant  if  u(  • )  .->  y(  • )  implies 

(cru)(')  -»  (oy)(-)>  where  u  is  the  shift  operator  defined  by 
(cm)(t)  :=  u(t  -  1). 

These  concepts  are  standard.  One  of  the  contributions  of  the 
present  work  is  the  introduction  of  a  new  notion,  that  of  a  "polynomial" 
input /output  map.  Informally,  this  means  that  y(t)  is  a  polynomial 
function  of  the  past  input  values  u(j),  j  <  t. 

In  order  to  rigorously  define  polynomial  input /output  maps  we  need 
the  concept  of  a  Volterra  series  \|f:  this  is  a  formal  power  series  in 
denumerably  many  variables  |.,  |g,  |_,  ...   such  that  i|f  is  of  finite 
degree  in  each  variable  separately.  A  causal,  shift-invariant  map  is 
uniquely  determined  by  specifying  the  dependence  of  y(0)  upon  values 
of  the  input  u(t)  for  t  <  0.  So  we  can  now  say,  more  precisely,  that 
a  causal,  shift- invariant  input/output  map  f  is  polynomial  iff  there 
exists  a  Volterra  series  \|r   such  that  the  output  y(0)  due  to  an 
input  sequence  u( • )  is  obtained  by  substituting  u(-  j)  for  |   into 
\|r   and  evaluating  the  expression  thus  obtained.  This  evaluation  is 
well-defined  because  there  are  only  finitely  many  nonzero  u(t)   (by 
definition  of  Sj,  and  because  tf  is  a  polynomial  in  each  finite 
subset  of  variables. 


The  present  formalism  is  able  to  represent  a  wide  variety  of  behaviors. 

For  example,  consider  f  ,   defined  by  tyf     =  a  J  +  a  \     +  ...  . 
Then  an  input  u( " )  produces  an  output 


X0a.u(-  J). 

Thus  f   corresponds  to  a  linear  system  with  impulse  response  sequence 

\,    *-2>    •••    ' 

Another  example  is  t-p„  =  2  £-?  |-i  ...6-1  ,  the  sum  running  over  all 
x2      *-*T_  "2     r 
j  <  j  <  ...  <  j   and  all  r  >  0.  Then  fp  corresponds  to  adding  all 

possible  products  of  past  inputs. 

The  main  problem  we  are  interested  in  is  the  following.  What  are 
natural  internal  (i.e.,  state-space)  representations  for  polynomial 
input /output  maps?  We  are  of  course  interested  in  representations  with 
a  certain  amount  of  algebraic,  geometric,  and/or  topological  structure; 
otherwise  the  above  question  could  be  trivially  answered  via  the  "Nerode 
realization"  method  of  automata  theory.  Further,  we  want  to  use  our 
results  to  infer  possible  internal  properties  of  a  given  "black  box";  so 
the  choice  of  structure  should  be  directly  related  to  properties  of 
polynomial  input /output  maps. 

Polynomial  systems  constitute  a  class  of  systems  whose  defining 
maps  are  always  polynomial.  A  polynomial  system  Z     (provisional 
definition)  has 

(a)  X  =  k   as  its  state-space  (n  =  integer); 

(b)  state-transitions  given  by  simultaneous  first  order 
difference  equations 

x(t  +  1)  =  P(x(t),  u(t)), 


where  x(t)  =  (x  (t),  ...,  x  (t))  and  P  =  (P^  ...,  PR)  is  a  polynomial 

.  n+m   ,  n 

k    ->  k  ; 

(c)  an  output  map 


,  n+m   ,  n 
function  k    ->  k  ; 


y(t)  =  h(x(t)), 

where  h  is  a  polynomial  in  n  variables;  and 

(d)  an  initial  state  x^  which  is  an  equilibrium  state  for  the 
zero  input: 

p(x*,  0)  =  xft. 

(The  constraint  on  x   to  be  an  equilibrium  state  is  dictated  by  our 
restriction  to  shift -invariant  input /output  maps;  the  specific  choice  of 
0  as  "equilibrium  input"  is  just  a  matter  of  choice  of  coordinates  in 
the  input  space.)  Let  us  denote  by  P  also  the  recursive  extension  of 
P  to  sequences  of  inputs,  i.e. 

P(x,  vr  ...,  vn+1)  :=P(P(X|  V  ...,  vn),  vn+1). 

Then  £  defines  an  input /output  map  f„:  u( • )  -> y( • )  by  the  rule 
y(t)  :=  h(P(x#,  u(t0),  u(tQ  +  1),  ...,  u(t  -  1)), 

where  t  <  t  is  any  integer  for  which  u(t)  =  0  if  T  <  tQ.  Then  f£ 
is  clearly  a  polynomial  input /output  map  because  it  is  defined  as  a 
composition  of  polynomials.  We  may  in  fact  exhibit  \|ff   directly  by  the 
rule  that  the  coefficient  of  a  monomial  ^1...^t  should  be  equal  to 
the  coefficient  of  the  same  monomial  in  the  polynomial 
h(P(xC,  St,  •-.,  Sx)). 

Thus  we  have  defined  a  large-  class  of  systems  whose  input/output  maps 
are  polynomial.  Such  systems  are  appealing  from  both  mathematical  and 
system-theoretic  reasons,  because  they  can  be  realized  by  finite 
interconnections  of  adders,  multipliers,  amplifiers,  and  delay  lines.  In 
order  to  get  a  reasonably  complete  and  general  theory,  however,  it  is 
necessary  to  go  beyond  polynomial  systems.  A  larger  class  of  systems, 
called  k- systems,  arises  when  we  study  the  problem  of  obtaining 
"canonical"  realizations  of  input /output  maps.  The  theory  to  be 
developed  will  show  that  k- systems  provide  the  right  amount  of 


generality  for  studying  realizations  of  polynomial  maps.  We  now 
motivate  their  introduction. 

One  of  our  main  objectives  is  to  obtain  realizations  which  are 
"natural"  or  "canonical"  in  the  sense  of  not  depending  on  any  information 
not  implied  by  the  input /output  behavior.  The  class  of  candidates  to  be 
considered  should  have  some  fixed  structure  (like  polynomial  systems) 
so  that  the  canonical  system  is  recoverable  just  from  the  knowledge  of 
its  external  behavior.  The  approach  which  has  been  highly  successful 
with  automata  and  linear  systems  consists  in  trying  to  construct 
realizations  which  are  as  "minimal"  or  "irredundant"  as  possible;  see 
for  instance  KALNM,  FALB,  and  ARBIB  [1969,  Chapters  7  and  10]  and 
EILENBERG  [1974,  Chapters  3,  12,  and  16] .  We  shall  adopt  such  a 
viewpoint  here,  beginning  with  polynomial  systems,  and  we  shall  see  how 
we  are  forced  to  introduce  more  general  systems. 

Let  us  consider  the  two-dimensional  system 
lx±(t   +  1)  =  x^t)  +  u(t), 
So  »  xg(t  +  1)  =  Xl(t)x2(t)  +  X]L(t)  +  x2(t), 
(y(t)  =  x2(t), 

with  initial  state  0.  It  is  easy  to  see  that  it  is  possible  to  reach 
from  0  any  state  /  J  in  k  ,  using  in  fact  inputs  of  length  not 


greater  than  three.  There  are,  however,  redundant  states  which  behave 
identically  in  the  sense  that  they  cannot  be  distinguished  by  input/ 
output  experiments.  They  are  of  the  form  (   -  J.  Any  other  states  can 


-  ly 

be  pairwise  distinguished  from  the  data  (x  ,  x  x  +  x..  +  x?)  resulting 

of  the  observation  of  the  output  at  two  consecutive  instants.  In  order 
to  obtain  a  system  with  no  unobservable  states,  we  must  identify  the 
states  [    )  for  all  a  in  k  and  we  must  then  define  appropriate 


x-  1 
"polynomial"  transitions,  compatible  with  the  original  P,   on  the 

quotient  set  thus  obtained.  To  have  a  well-defined  notion  of  "polynomial 

map"  we  must  first  endow  our  quotient  set  with  a  suitable  notion  of 

"coordinate  system",  i.e.  we  need  to  define  in  it  a  geometric  structure. 


But  this  structure  may  not  correspond  to  a  polynomial  system. 

It  turns  out  that  the  input /output  map  f2   of  the  above  system  £ 
admits  no  observable  polynomial  realization.  This  remains  true  for  the 
weaker  question  of  existence  of  polynomial  realizations  for  which  we 
only  require  the  property  of  distinguishable  reachable  states.  In  other 
words,  it  is  in  general  impossible  to  embed  the  "Nerode  realization"  of 
an  input /output  map  in  a  polynomial  system,  even  if  f  is  the  input/ 
output  map  of  a  polynomial  system. 

The  natural  algebraic-geometric  way  to  proceed  consists  in 

introducing  the  notion  of  (k-points  of)  an  affine  k- scheme,  or,  as  we 

shall  say  for  short,  a  k- space.  Such  a  space  consists  of  a  topological 

space  X  together  with  an  algebra  of  polynomial  functions  on  X  (a 

distinguished  family  of  continuous  functions  on  X  subject  to  appropriate 

axioms).  In  particular,  the  spaces  k   become  k-spaces  when  endowed 

with  the  "Zariski  topology",  whose  closed  sets  correspond  to  subsets  of 

k   defined  by  polynomial  equations;  the  polynomial  functions  on  the 

n 
k-space  k   are  the  usual  polynomial  functions  in  n  variables.  Thus 

our  previous  choice  of  state-spaces  furnishes  an  (easy)  example  of 

k-spaces.  Given  two  k-spaces  X.,  X2  there  is  a  well-defined  concept 

of  polynomial  map  P:  X..  ->X?;  these  are  precisely  those  maps  which  when 

composed  with  the  polynomial  functions  on  X„  give  polynomial  functions 

on  X.. . 

A  k- system  T.    is  then  defined  by  letting  the  state  set  X„  be  an 
arbitrary  k-space  and  letting  the  transition  map  X_  x  k  ->Xj,  and  the 
output  function  X_  ~» k  be  polynomial  maps .  The  fundamental  observation 
is  that  the  input /output  maps  of  k- systems  are  polynomial. 

Conversely,  each  polynomial  input /output  map  can  be  realized  by  some 
k- system.  This  fact  follows  rather  trivially  once  that  k-spaces  have 
been  recognized  as  the  proper  state  spaces.  The  proof  relies  on  turning 
the  space  of  input  sequences  into  a  k-space  ft  in  such  a  way  that  the 
notion  of  (polynomial)  input/output  map  becomes  precisely  that  of  a 
polynomial  map  between  k-spaces. 


Having  established  k-systems  as  the  class  of  systems  to  be 
considered,  we  return  to  the  problem  that  motivated  the  introduction  of 
k-spaces  in  the  first  place,  namely,  the  existence  of  "observable" 
realizations. 

We  shall  prove  that  in  the  new  class  of  systems  it  is  always  possible 
to  "reduce"  a  given  system  to  one  all  of  whose  states  can  be  distinguished 
by  input /output  experiments.  Nevertheless,  this  does  not  settle  the 
question  of  observability.  It  was  noticed  already  in  SONTAG  and 
ROUCHAIEAU  [1975]  that  there  exist  input /output  maps  having  realizations 
Z  ,  Z   both  of  which  are  reachable  and  observable  but  such  that  Z   and 
Z   are  nonisomorphic  (as  k-spaces).  In  fact,  the  example  given  in  the 
above  reference  has  X,  =  k  while  X   is  a  curve  with  a  singularity, 
a  very  different  kind  of  k- space.  The  difficulty  lies  in  the  concept  of 
observability  itself.  This  notion  is  usually  defined  by  the  intuitive 
requirement  that  different  states  be  "distinguishable  by  processing  the 
input /output  data".  The  precise  notion  in  this  context  is  that  different 
states  should  be  distinguishable  by  an  algebraic  processing  of  the  input/ 
output  data.  This  point  of  view  leads  to  the  definition  of  algebraic 
observability,  which  turns  out  to  be  the  proper  notion  in  our  context. 

The  next  step  in  our  program  for  obtaining  a  "canonical"  realization 
of  a  given  input/output  map  is  to  construct  an  observable  realization  all 
of  whose  states  are  reachable  from  the  initial  state.  Here  we  run  into  a 
new  problem:  the  reachable  set  of  an  arbitrary  system  is  not  necessarily 
a  k-space.  For  instance,  let  us  consider  a  two-dimensional  system  with 
transitions  defined  by 

X;L(t  +  1)  =  u(t), 

x2(t  +  1)  =  x2(t)u(t)  +  x2(t)  +  u(t), 

and  zero  initial  state.  The  reachable  set  fails  to  contain  the  points 
W-l. 

This  difficulty  can  be  easily  eliminated.  Our  ultimate  goal  is  not 
to  obtain  reachable  and  observable  realizations  but  rather  to  construct 


and.  ze: 

(V) 


"natural"  realizations.  It  is  therefore  enough  to  observe  that  (for 
continuity  reasons),  the  dynamical  properties  of  the  reachable  part  of  a 
system  Z  uniquely  determine  the  dynamical  properties  of  the  closure 
(in  the  topology  of  the  k-space  X  )  of  the  set  of  reachable  states, 
(in  the  above  example  the  closure  corresponds  to  the  whole  plane.)  We 
shall  say  that  £  is  quasi-reachable  if  the  closure  of  the  reachable 
states  is  X„«  The  closure  of  the  subset  of  reachable  states  is  always 
a  k-space  invariant  under  the  action  of  inputs.  So  a  quasi -reachable 
realization  can  always  be  obtained  from  an  arbitrary  realization.  If  we 
begin  with  a  polynomial  system,  the  closure  of  the  reachable  set  is  a 
very  special  type  of  k-space  namely,  an  algebraic  variety.  It  is 
natural  therefore  to  generalize  our  preliminary  definition  of  polynomial 
systems  to  include  the  case  in  which  X„  is  a  variety  (not  necessarily 
k  ).  In  other  words,  a  polynomial  system  is  given  by  a  finite  set  of 
simultaneous  polynomial  difference  equations  together  with  a  set  of 
polynomial  constraints  on  the  state  variables. 

We  shall  say  that  a  k-system  is  canonical  if  it  is  quasi- reachable 
and  algebraically  observable.  One  of  the  main  results  of  this  work  is 
then:  Every  input /output  map  f  admits  a  canonical  realization  Z 
and  any  other  canonical  realization  of  f  is  isomorphic  to  £f.  We 
have  thus  attained  our  goal  of  determining  a  natural  class  of  state 
representations  for  polynomial  input /output  maps. 

The  result  on  existence  and  uniqueness  of  canonical  realizations 
must  be  complemented  by  a  discussion  of  finiteness  conditions.   In 
principle,  there  is  of  course  no  guarantee  that  the  state-space  X-  of 
£„  is  in  any  sense  "finite  dimensional". 

We  have  chosen  the  "transcendence  degree"  notion  of  dimension  out  of 
the  many  possible  definitions  of  dimension  of  k-spaces.  The  dimension 
of  a  system  Z  is  then  the  dimension  of  X  .  Informally,  the  dimension 
of  Z  counts  the  "degrees  of  freedom"  in  the  state  space.  In  the 
particular  case  of  polynomial  systems  the  dimension  is  what  one  would 
intuitively  expect.  For  instance,  if  X„  is  the  "cusp" 
{{xv   x2)  G  k2  |  xj  =  x2},  then  dim  E  =  1. 


We  shall  say  that  a  given  system  Z  is  almost  polynomial  when  X 
can  "be  obtained  as  a  "quotient"  of  some  space  k   (the  terminology 
"quotient"  is  not  quite  precise  here,  since  we  shall  have  to  admit  in 
general  the  existence  of  some  points  besides  those  representing  the 
equivalence  classes  of  points  of  k  ).  The  name  "almost  polynomial" 
is  due  to  the  fact  that  in  this  case  X„  can  he  expressed  as  a  union  of 
a  variety  and  a  lower-dimensional  subset. 

A   central  contribution  of  our  work  is:  The  input /output  map  f  has 
a  finite-dimensional  realization  if  and  only  if  Z   is  an  almost- 
polynomial  system  if  and  only  if  f  satisfies  an  algebraic  difference 
equation,  i.e.,  if  and  only  if  there  exists  an  integer  s  and  a 
polynomial  E  in  2s  +  1  variables  such  that 

E(y(t),  y(t  -  1),  ...,  y(t  -  s),  u(t  -  l),  ...,  u(t  -  s))  =  0, 

for  all  input /output  pairs  u(-)>  y( * ) •  Up  to  constant  multiples  there 
is  a  unique  irreducible  equation  E  =  0  of  minimal  order  s  satisfied 
by  f.  We  shall  also  prove  that,  if  f  satisfies  some  algebraic 
difference  equation,  then  f  satisfies  as  well  an  equation  f   =  0 
linear  in  y(t). 

As   a  simple  illustration  of  the  above  results,  let  f  :=  fr  ,  where 
Z   is  the  system,  introduced  before, 


fx^t  +  1)  =  xx(t)  +  u(t), 
Zq  =  |xg(t  +  1)  =  X;L(t)x2(t)  +  Xl(t)  +  x2(t), 


|y(t)  -  x2(t), 

with  zero  initial  state.  The  canonical  state-space  X   is,  as  a  set, 

2 
the  union  of  the  singleton  {*)  and  the  subset  U  :=  {x  /  -  l)  of  k  . 

The  transition  and  output  maps  of  Z„  are  those  induced  by  the  projection 

T:  k  -^X_:  x  .->x  if  x  is  in  U,  x  ->  *  if  x  =  -  1.   (These  are 

polynomial  maps  for  a  suitable  k-space  structure  on  X„.)  The  irreducible 

equation  of  minimal  order  satisfied  by  f  is 


[y(t  -  2)  +  l][y(t)  +  1]  -  [y(t  -  l)  +  if  - 

-  [y(t  -  1)  +  l][y(t  -  2)  +  l]u(t  -  2)  =  0. 

In  the  "classical"  case  of  linear  systems  it  is  well  known  that  a 
system  is  canonical  iff  it  is  a  minimal- dimensional  realization  of  its 
input/output  map.  This  result  does  not  generalize  directly  to  the 
present  situation.  A  counterexample  is  given  by  the  system  (with 
X  =  k1) 

lx(t  +  1)  =  u(t),  x8  =  0, 

Z~   |y(t)  =x2(t). 

Clearly,  E  is  not  canonical,  because  all  pairs  of  states   {a,  -  a] 

are  indistinguishable.  However  E  is  minimal,  since  it  has  dimension  1. 

The  proper  treatment  of  the  above  minimality  question  is  through  the 
concept  of  weakly  canonical  realizations.  We  shall  say  that  E  is 
weakly  canonical  when  it  is  quasi- reachable  and  (in  a  sense  to  be  made 
precise)  "almost  all"  states  are  indistinguishable  of  only  finitely 
many  other  states.  The  example  in  the  previous  paragraph  is  therefore 
weakly  canonical,  since  in  fact  each  state  is  indistinguishable  of  only 
one  other  state.  Let  k  be  either  the  field  of  real  numbers  or  an 
algebraically  closed  field.  We  prove  that  a  realization  E  of  a 
polynomial  input /output  map  f  is  of  minimal  dimension  among  all 
realizations  of  f  if  and  only  if  2  is  weakly  canonical.  Over  any 
field  k,  canonical  realizations  are  minimal. 

The  question  of  deciding  when  E   is  in  fact  a  polynomial  system 
(i.e.  X_  is  a  special  kind  of  k- space:  a  variety)  can  be  answered 
theoretically  via  the  introduction  of  the  observation  algebra  Af  of 
the  input /output  map.  This  is  a  k-algebra  which  is  canonically 
associated  to  any  given  f .  We  prove  that  E   is  a  polynomial  system 
precisely  when  A„  is  finitely  generated  as  a  k-algebra.  Further,  the 
smallest  n  for  which  E   can  be  embedded  in  a  system  of  n 
simultaneous  polynomial  difference  equations  is  equal  to  the  minimal 
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Lon  of  the  type 

a(u(t 

-  1),  ...,  u(t  -  J 

0)y(t)r 

+  E  =  0, 

where  y(t)  appears  in  E  with  degree  less  than  r.  Thus  if,  for 
instance,  f  is  known  to  satisfy  a  regression  equation,  the  realization 
theory  of  f  can  in  principle  be  carried  out  without  introducing  the 
concept  of  k-spaces.  Even  in  this  special  case,  however,  the  general 
theory  is  needed  in  order  to  understand  the  meaning  of  the  special 
hypothesis . 

One  of  the  main  results  is  valid  for  input /output  maps  defined  over 
fields  k  which  contain  the  rational  numbers.  The  result  states  that 
f  has  a  finite  realization  if  and  only  if  the  Jacobian  matrices  in  a 
certain  sequence  J, (f),  J?(f),  ...  have  a  uniformly  bounded  rank.  For 
a  trivial  example,  we  point  out  that  when  f  is  linear  the  matrix 
J  (f)  is  precisely  the  n-th  principal  minor  of  the  behavior  (Hankel) 
matrix  of  f . 

All  the  results  presented  up  to  this  point  are  proved  later  for 
mult ivari able  polynomial  input /output  maps,  for  which  both  the  inputs 
and  outputs  are  vector- valued . 

Proofs  of  the  preceding  results  use  tools  of  algebraic  geometry.  In 
other  words,  we  use  the  "theory  of  polynomials"  in  the  study  of  arbitrary 
polynomial  input /output  maps. 

The  second  part  of  this  work  deals  with  a  broad  class  of  bounded 
(polynomial)  input /output  maps,  whose  study  can  be  "linearized".  This 
linearization  permits  us  to  obtain  sharper  statements.  The  isolation 
of  the  class  of  bounded  maps  is  one  of  the  contributions  of  this  work. 

Bounded  maps  f  are  defined  as  follows.  Recall  that  if„  has  a 
finite  degree  d .  in  each  variable  E,  . .  We  say  that  f  is  bounded  when 
the  degrees  d.  are  bounded  independently  of  j.  In  other  words,  there 

(J 
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exists  an  integer  d  such  that  no  input  is  raised  to  a  power  higher  than 
d.  There  are  no  restrictions  on  products  between  inputs  at  different 
instants  and/or  different  channels.  It  is  at  first  surprising  that  the 
concept  of  bounded  map  includes  as  particular  cases  all  those  families 
of  maps  for  which  a  satisfactory  realization  theory  has  been  developed 
in  the  past.  For  instance,  linear  systems,  internally-bilinear  systems 
(BROCKETT  [1972],  ISIDORE  and  RUBERTI  [19731,  ISIDORI  [1973,  197*+], 
FLIESS  [1973,  1975],  D'ALLESSANDRO,  ISIDORI,  and  RUBERTI  [197*0,  and 
others)  give  rise  to  bounded  maps,   (internally-bilinear  systems  are 
those  whose  internal  map  is  bilinear  in  the  state  and  input  and  whose 
output  map  is  linear.  Wo  products  of  inputs  at  same  instants  are 
performed,  by  such  systems,  so  d  :=  1  bounds  all  d..)  Multilinear 
input /output  maps  (KAIMAN  [ 1968,  1976])  are  also  included.   (Such  maps 
allow  products  of  inputs  only  in  different  channels,  so  that  d  :=  1 
is  again  a  bound.) 

We  prove  that  if  a  bounded  input /output  map  is  at  all  realizable  by 
a  finite  dimensional  k-systemt  then  it  is  also  realizable  by  an 
(observable)  state-affine  system.  The  latter  are  (polynomial)  systems 
with  X  =  kn  whose  defining  equations  take  the  special  form 

x(t  +  1)  =  F(u(t))x(t)  +  G(u(t)),   x»  =  0, 

y(t)  -  Hx(t). 

where  F(-)  and  G( • )  are  polynomial  matrices  and  H  is  a  linear  map. 

The  characteristic  feature  of  state-affine  systems  is  the  linear 

occurrence  of  the  state  variable. 

The  above  readability  result  establishes  state-affine  systems  as  a 

very  useful  and  natural  class  of  systems  with  respect  to  bounded  maps. 

Input/output  maps  realizable  by  state-affine  systems  (equivalents, 
finitely  realizable  bounded  maps)  are  precisely  those  whose  "observation 
space"  (a  linear  space  directly  associated  to  the  map)  is  finite- 
dimensional.   These  and  other  results  indicate  that  state-affine  systems 
play  an  "approximation"  role  in  the  discrete  theory  similar  to  the  role 
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We  then  restrict  our  attention  to  realizations  by  state-affine 
systems.  Canonical  realizations  can  now  be  obtained  (for  bounded  maps) 
without  recourse  to  k-spaces.  In  fact,  it  is  now  natural  to  define 
span-canonical  state-affine  systems  as  observable  systems  such  that  the 
linear  span  of  the  reachable  states  is  the  full  state- space  k  .  We 
then  prove  that  span-canonical  realizations  of  a  given  bounded  finitely 
realizable  f  always  exist.  Further,  any  two  such  realizations  can  be 
related  by  a  linear  change  of  coordinates  in  the  state-space.  Finally, 
a  realization  is  span  canonical  if  and  only  if  its  dimension  n  is 
smallest  possible  among  all  state-affine  realizations  of  the _g^me_input/ 
output  map. 

The  above-mentioned  results  are  proved  by  first  associating  to  the 
bounded  map  f  the  exponent  formal  power  series  cp   obtained  directly 
from  the  Vol  terra  series  \|r_.  As  opposed  to  |f,   the  exponent  series  is 
a  power  series  in  noncommutative  variables.   The  transformation  tf  •-»  cpf 
permits  the  explicit  consideration  of  dynamics.  We  then  remark  that 
state-linear  realizations  are  in  a  one-to-one  correspondence  with 
representations  of  cp  .   (The  concept  of  representation  of  a 
noncommutative  power  series  was  introduced  by  SCHUTZENBERGER  [  1961J  as  a 
generalization  of  automata-theoretic  ideas,  and  has  been  rediscovered 
since  by  many  authors,  notably  in  the  context  of  stochastic  automata. 
Representations  have  been  called  sequential  systems  by  CARLYLE  and  PAZ 
[1971]  and  linear-space  automata  by  TURAKAINEN  [1972].  A  fairly  complete 
account  of  representations,  also  called  "automata  with  multiplicities", 
may  be  found  in  EILENBERG  [191k].     The  notion  of  representation  which  we 
use  is  in  fact  a  minor  variation  of  that  in  the  literature.)  The  idea 
of  associating  representations  to  systems  is  not  totally  original,  since 
an  analogous  method  was  used  by  FLIESS  [ 1975]  to  study  the  special  case 
of  internally-bilinear  systems.  We  give  a  brief  but  self-contained 
exposition  of  those  results  on  representations  which  are  relevant  to  our 
work. 
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An  interesting  observation  is  that,  under  the  above  one-to-one 
correspondence,  span-reachability  a.nd  observability  for  state-affine 
systems  corresponds  precisely  to  (automata- theoretic)  reachability  and 
observability  for  representations.  Realizability  of  f  can  then  be 
studied  via  the  behavior  (Hankel)  matrix  B_(f)  of  cp  ,   and  minimal 
state-affine  realizations  can  be  obtained  by  operating  on  B(f),  using 
the  methods  developed  for  representations  by  FLIESS  [1972,  1975] •  We 
give  such  a  realization  procedure,  which  generalizes  and  unifies  known 
algorithms  for  linear  and  for  the  various  kinds  of  bilinear  systems.  An 
interpretation  of  B_(f)  follows  from  the  remark  that  the  observation 
space  of  f  is  isomorphic  to  the  row  space  of  B(f). 

We  sharpen  the  result  on  algebraic  difference  equations  by  proving 
that  a  bounded  map  is  finitely  realizable  if  and  only  if  it  satisfies 
an  input /output  difference  equation  which  is  linear  in  the  output.  This 
is  a  new  result  even  in  the  (very  special)  cases  of  internally-bilinear 
systems  and  multilinear  input /output  maps. 

SCHUTZENBERGER  [  196l]  gave  a  generalization  to  power  series  of 
Kleene's  theorem:   A  language  L  is  recognizable  by  a  finite  automaton 
if  and  only  if  L  can  be  described  by  a  regular  expression.  This 
generalization  can  be  applied  to  cp   via  the  above  correspondence 
between  state-affine  systems  and  representations.  The  conclusion  is 
that  f  has  a  state-affine  realization  if  and  only  if  cp   is  rational 
i.e.,  if  and  only  if  (p-  can  be  obtained  from  polynomials  by  a  finite 
number  of  additions,  multiplications,  and  inversions.  As  a  consequence, 
it  becomes  possible  to  apply  the  standard  calculus  of  interconnections 
of  automata  (see,  for  instance,  EILENBERG  [ 197U] )  to  find  cpf  (and 
therefore  \|r_)  from  any  state-affine  realization  of  f,   and,  viceversa, 
to  construct  realizations  given  rational  expressions  for  cp.,. 

We  also  define  the  subclass  of  finite  maps  f ,   corresponding  to  the 
restriction  that  the  total  degree  of  tf  should  be  finite.  We  show 
that  the  span  canonical  realization  of  such  maps  can  be  decomposed  as  a 
cascade  of  linear  systems  and  memory- free  nonlinearities .  The  existence 
of  such  decompositions  characterizes  finite  maps. 
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of  nonlinear  systems,  the  present  work  is  no  exception  to  that  hope. 

One  of  the  byproducts  of  an  algebraic  study  of  systems  is  of  course 
the  development  of  algorithms  for  system  analysis  and  design.  In  the 
case  of  bounded  maps  and  state-affine  systems,  we  use  linear-algebraic 
techniques  in  constructing  canonical  realizations;  these  methods  are  a 
rather  simple  generalization  of  the  classical  Hankel  matrix  technique 
used  so  successfully  in  linear  system  theory.  Finite  dimensionality  of 
the  observation  space  is  responsible  for  the  linear-algebraic  character 
of  the  study  of  bounded  maps .  This  means  that  a  nonlinear  computational 
technique  is  indispensable  as  soon  as  nonlinear  feedback  is  present  in  a 
system.  An  important  question  is,  then:  How  effective  are  calculations 
with  fields,  algebras  and  polynomials?  From  its  origins  until 
(historically)  not  long  ago,  algebra  remained  to  a  great  extent  a 
computational  discipline.   The  development  of  "modern"  algebra  [or  the 
modern  development  of  algebra]  has  shifted  the  emphasis  towards 
generality  and  abstraction,  permitting  both  the  solution  of  heretofore 
unsolvable  problems  and  the  understanding  of  deep  questions  which  can 
only  now  be  even  formulated  in  a  rigorous  way.  Many  questions  of 
effective  calculation  have  thus  been  left  aside  of  the  mainstream  of 
algebra;  a  development  which  is  particularly  unfortunate  in  view  of  the 
advent  of  the  digital  computer.  However,  there  are  now  signs  of  a  trend 
toward  the  effectivization  of  various  basic  algebraic  constructions. 
Some  of  these  constructions  can  be  used  to  solve  system- theoretic  questions. 
For  instance,  SEIDENBERG  [1971]  has  worked  on  effective  versions  of  Hilbert's 
Basis  Theorem,  and  his  results  find  an  immediate  application  to  questions  of 
observability  (SONTAG  and  ROUCHALEAU  [1975])-  The  posthumous  work  of 
ROBINSON  [19753  (see  also  CROSSEEY  and  NERODE  [1975])  represents  a 
promising  approach  to  questions  of  computability  in  algebra,  attacking  such 
questions  from  the  point  of  view  of  mathematical  logic  (model  theory),  but 
most  of  the  detailed  work  remains  to  be  done. 

Of  course,  there  is  a  large  number  of  classical  results,  dealing  with 
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resultants  and  derivatives,  and  sometimes  referred  to  by  the  label 
elimination  theory,  which  permit  the  effective  verification  of  certain 
conditions;  our  Jacobian  criterion  for  finite-dimensional  realizability 
involves  a  simple  application  of  such  results.  It  would  certainly  be 
of  interest  to  explicit ely  compute  the  form  of  similar  criteria  for 
other  problems;  for  instance,  quasi-reachability  of  a  polynomial  system 
is  also  expressible  by  a  Jacobian  condition. 

Many  theoretical  algebraic  problems  are  suggested  by  the  present 
work.  For  instance,  as  a  rule  system-theoretic  questions  depend  for 
their  clarification  upon  the  development  of  a  real,  (as  opposed  to  complex) 
algebraic  geometry.  The  papers  of  WHITNEY  [1957]  and  of  DUBOIS  and 
EFROYMSON  [1970]  are  among  the  few  works  in  this  area.   In  fact,  the  study 
of  points  in  more  arbitrary  fields  (e.  g.,  the  rational  numbers)  is  needed 
from  our  viewpoint.  For  instance,  the  question  of  the  validity,  over 
k  ^  reals  or  k  not  algebraically  closed,  of  the  theorem:   "A  realization 
is  minimal  if  and  only  if  it  is  weakly  canonical'7,  leads  to  (unsolved) 
problems  of  an  arithmetic,  rather  than  geometric,  nature.  Another  "pure" 
algebraic  problem  generated  by  our  research  is  that  of  studying  the 
structure  of  the  "almost  varieties"  or  "quotients  of  varieties"  which 
appear  as  our  canonical  state-spaces.  A  thorough  understanding  of  such 
spaces  is  an  essential  prerequisite  to  the  practical  application  of  the 
most  general  of  our  results,  i.  e.,  those  dealing  with  the  case  in  which 
Z   is  not  a  polynomial  system. 

There  are  many  problems  which  probably  do  not  require  finding  new 
algebraic  results.  A  typical  open  question  of  this  type  is:  If  k  is 
algebraically  closed,  is  the  reachable  part-  of  every  polynomial  system 
actually  reachable  in  bounded  time? 

Insofar  as  we  have  attacked  the  realization  problem  using  methods  not 
standard  in  system  theory,  there  arises  the  possibility  of  applying  the 
same  methods  to  the  study  of  other  system-theoretic  questions.  Some 
steps  toward  this  broader  goal  were  taken  in  SONTAG  and  ROUCHALEAU  [1975], 
but  here  again  most  of  the  work  remains  to  be  done. 


CHAPTER  II.   ALGEBRAIC  PRELIMINARIES 

In  this  chapter  we  shall  briefly  discuss  soir.e  basic  notions  of 
algebraic  geometry  which  are  used  in  the  sequel.  The  main  object  to  be 
introduced  is  the  set  of  k-points  of  an  affine  k- scheme  (k  =  field); 
we  shall  simply  call  this  object  a  ,rk-space". 

The  study  of  k- spaces  is  per  se  not  included  in  standard  texts  in 
algebraic  geometry;  usually  one  studies  instead  the  set  of  all  points  of 
a  scheme  and  then  tries  to  deduce  special  properties  of  the  k-points. 
For  instance,  the  study  of  finitely  generated  reduced  schemes  over  the 
reals  R,   i.e.  the  study  of  solutions  of  polynomial  equations  with  real 
coefficients 

(*)     Pi(x1,  ...,  xn)  =  0,   i  =  1,  ...,  r, 

focuses  on  the  complex  solutions  (x, ,  .  ..,  x  )  in  C   of  (*)  rather 
than  on  the  real  solutions.   This  approach  has  proved  highly  appealing, 
since  statements  concerning  the  set  of  complex  solutions  do  not  have  to 
elucidate  certain  exceptional  or  degenerate  cases.  In  fact,  it  is 
customary  to  proceed  a  step  further  and  embed  the  corresponding  problem 
in  projective  space. 

To  infer  the  nature  of  the  set  of  k-points  from  the  properties  of 
the  entire  scheme  is  not  always  a  straightforward  matter;  it  may  involve, 
in  fact,  nonalgebraic  (e.g.  differential-geometric)  arguments. 

For  purposes  of  this  exposition  we  have  adopted  the  procedure  of 
defining  k- spaces  directly.  We  shall  give  here  the  definitions  and  the 
main  results  needed  later.  With  the  exception  of  some  trivial  statements, 
no  proofs  will  be  given  for  those  facts  for  which  a  precise  reference  is 
available  (and  given).   There  is  unfortunately  no   single  source  for  the 
results  quoted.  We  rely  mainly  on  BOURBAKI  [19721  and  DIEUDONNE  [  197'-^  • 
Except  for  some  matters  of  style  and  emphasis,  no  original  contributions 
appear  in  this  chapter. 
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1.   It-Reduced  Algebras. 

Let  k  denote  an  arbitrary  but  infinite  field,  to  be  fixed 
throughout  the  discussion.   Recall  that  a  (commutative)  k- algebra  (or, 
simply,  an  algebra)  is  a  pair  (A,  ©),  where  A  is  a  commutative  ring 
with  identity  and  ep:  k  -»A  is  a  ring  homomorphism  with  cp(l)  =  1.  We 
shall  denote  such  algebras  by  the  corresponding  ring  A  and  identify 
k  with  its  (isomorphic)  image  cp(k) .   Thus  the  scalar  product  r.a, 
r  €  k,   a  £  A,   is  the  multiplication  in  A .   The  field  k  may  always 
be  viewed  as  a  k-algebra,  with  cp  =  identity. 

A  homomorphism  of  k-algebras   u:  A  -» B  will  mean  a  homomorphism 
whose  restriction  to  cp(k)  =  k  is  the  identity. 

We  adopt  the  following  notation  conventions: 

(i)  the  first  few  upper-case  Latin  letters  A,  B,  C,  ... 
denote  k-algebras; 

(ii)  A  8)  B  or  simply  A  ®  B  is  the  tensor  product  algebra; 

(iii)   if  A   is  an  integral  domain,  then  Q(A)  denotes  the 
quotient  field  of  A; 

(iv)  k[T  ,  ...,  T  1  denotes  the  ring  of  polynomials  in  r 
variables  over  k;  when  r  =  1  we  write  simply  k[Tn; 

(v)   "homomorphism"  will  always  mean  k-algebra  homomorphism; 

(vi)  Horn  (A,  B)   denotes  the  set  of  all  homomorphisms  A  ->B. 

(1.1)   DEFINITION.  A  k- ideal  M  of  a  k-algebra  A  is  the  kernel  of 
a  homomorphism  A-k.   The  k-radical  rad^  A  of  A  is  the  intersection 
of  all  k-ideals  of  A . 

Let  M  be  a  k- ideal  of  A.   Since  A   is  a  k-algebra,   A  M  *  k  is 
a  field,  so  M  is  maximal,  but  not  every  maximal  ideal  of  A  is  a 
k-ideal.   For  instance,  let  k  =  R  and  A=R[Tl.   Then  the  ideal  M 
generated  by  x2  +  1  is  maximal  (because  I   +1  is  an  irreducible 
polynomial)  but  A /to  *  C  4  R.   In  the  particular  case  in  which  A  is 
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finitely  generated  and  k  is  algebraically  closed,  all  maximal  ideals 
are  k-ideals;  this  is  a  consequence  of  Hilbert's  Nullstellensatz;  see 
BOURBAKI  [1972,  V.3.3,  Proposition  2]. 

There  is  a  bijective  correspondence  between  k-ideals  and 
homomorphisms  A  -» k.  Indeed,  let  u,  V:  A  -»  k  and  suppose  that 
ker  |i  =  ker  v.  Take  any  x  in  A.  Since  u(x)  is  in  k  CA  and 
M  is  the  identity  on  k,   x  -  u(x)  belongs  to  ker  u  =  ker  v.  Thus 
0  =  v(x  -  u(x))  =  v(x)  -  u(x).  So  the  maps  u  and  v  are  equal. 

Let  X  be  a  set.   The  set  k   of  all  functions  X  -> k  is  a 
k-algebra  under  the  pointwise  operations,  the  constant  functions 

y 

constituting  the  subring  isomorphic  to  k.  A  subalgebra  of  k   is 
called  an  algebra  of  functions  on  X. 

(1.2)   LEMMA.   The  following  statements  are  equivalent: 

(a)  rad^.  A  -  {0}. 

(b)  A  is  isomorphic  to  an  algebra  of  functions. 

PROOF,   (b)  implies  (a).  Let  A     be  identified  with  a  subalgebra 

X 
of  k  .  For  each  x  in  X,  the  evaluation  map 

e  :  k  ->  k:  cp  ■->  cp(x), 

restricted  to  A  is  a  homomorphism,  hence  ker  ex|A  is  a  k- ideal. 
Clearly  then  ra<L_  Ac  H  (ker  e  ,  x  in  X)  =  (0). 

(a)  implies  (b).   Let  X(A)  denote  the  set  of  all  homomorphisms 
u:  A  ->  k.   Define  i:   A  ->  k  ^    by  evaluation: 

i(a)(u)  :=  u(a). 

Then  t  is  a  homomorphism;  moreover,  it  is  in  fact  one-to-one.  To 
prove  this,  assume  that   t(a)  =  0,   i.e.   t(a)(u)  =  u(a)  =  0  for  some 
a  in  A  and  for  all  u  in  X(A) .  Then  a  is  in  the  kernel  of  every 
u:  A  ->k,   i.e.  in  the  k-radical  of  A,  which  is  0  by  hypothesis. 
So  a  =  0.  Therefore  A  ~  t(A).  D 
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Let  us  observe  the  duality  implicit  in  the  preceeding  arguments. 
Homomorphisms  A  — -  k  may  he  viewed  as  points  on  which  elements  of  A 
act  by  evaluation. 

This  duality  is  fundamental  in  algebraic  geometry . 

(1.3)   DEFINITION.  An  algebra  satisfying  the  conditions  in  (1.2)  is 
called  a  k- reduced  algebra . 

For  every  k- algebra  A,   the  quotient  ring  A/rad^  A  is 
k-reduced. 

An  important  fact  related  to  (1.3)  is  Hilberts's  Nullstellensatz, 

which  can  be  phrased  as  follows:  A  finitely  generated  algebra  A  over 

an  algebraically  closed  field  k  i_s  k-reduced  if  and  only  if  A  has 
no  nonzero  nilpotents . 

A  recent  generalization  of  this  celebrated  result  (DUBOIS  [1967]", 

DUBOIS  and  EFROYMSON  [1970])  is  the  following:  A  finitely  generated 

algebra  A  over  a  maximally  ordered  field  k  (e.g.  k  =  R)  is 

'  vn    2 
k-reduced  if  and  only  if  for  any  x.   in  A  the  relation  1,    xi  =  0 

implies  x.  =  0  for  all  i. 

The  main  idea  in  what  follows  is  to  view  the  elements  of  a 
k-reduced  algebra  A  as  functions  on  X(A).   Take  x  in  X(A),   a  in  A. 
Viewed  as  a  function  t(a)  on  X(A),   a  has  the  value  t(a)(x)  =  x(a) 
at  x.  Except  when  discussing  certain  delicate  points.,  we  shall  therefore 
identify  A  with  its  image  under  1     and  so  we  shall  write  i(a)  as  a 
and  t(a)(x)  as  a(x).  The  fact  that  x  is  a  homomorphism  means  that 
the  algebra  operations  in  A  are  now  represented  as  pointwise  operations: 
(ab)(x)  =  a(x)b(x). 

It  is  worth  keeping  in  mind  the  following: 

(l.k)       EXAMPLE.   Let  A  :=  k[T-,  ...,  T  ] .   Since  k  is  infinite,  a 
polynomial  a(T  ,  ....  T  )  can  be  identified  with  the  polynomial  function 
kn  ->k:  (x  ,  ...,  x  )  H»a(x-,  . ..,  xn) .   Thus,  by  (1.2).  A  is  k-reduced. 
A  homomorphism  U:  A  -->  k  is  completely  determined  by  giving  values  of 


20 


H(T  ),  ...,  u(T  )   in  k.   Conversely,  for  any  choice  of  (x, x  ) 

n        n  In 

in  k   there  is  a  homomorphism  u:  A  -» k  defined  by  |i(T.)  =  x..   Thus 

n  X(A)        1     1 

one  identifies  X(A)  with  k   and  (,:  A  -» k     with  the  assignment: 

polynomial  «->  polynomial  function.  U 


(1.5)   LEMMA  (canonical  factorizations).   Let  1 : 
k- reduced. 

(i)  Then  there  exists  a  factorization 


where  B  is 


where  u  is  onto,   v  is  one-to-one,   and  C  is 


k- reduced. 

.  ~   v 


(ii)  Further,  let  two  factorizations 


; 


B  of  t  be  given,  with  u.   onto  and 


— ►  B  and 
one-to-one. 


Then  there  exists  a  unique  r\:   C.  -»  Cp  such  that  the  following  diagram 
commutes : 


PROOF.   The  existence  of  factorizations   (u,  C,  v)   and  of  the 
map  T)  are  elementary  algebraic  facts.  We  must  only  prove  that  any 
such  C  is  k-reduced.   But  C  -  v(C)  C  B,   and  B  is  an  algebra  of 
functions.  So  C  is  also  an  algebra  of  functions.  □ 

The  above  lemma  plays  an  important  role  in  characterizing  canonical 
realizations,  analogous  to  the  role  of  its  linear  variant,  Zeiger's 
lemma  (KALMAN,  FALB,  and  ARBIB  [ 1969,  Chapter  10,  Lemma  (6.2)]),  in 
linear  system  theory. 


(1.6)   REMARK.   Products  of  k-reduced  algebras  are  k-reduced.   Indeed, 

X  ■ 
assume  that  A.   is  a  subalgebra  of  k  0   for  each  j   in  J.   Then 

J 
II  A   is  an  algebra  of  functions  on  the  disjoint  union  of  the  X.  •       □ 
J  J  '  J 


It  follows  from  the  definition  of  the  tensor  product.  ®  that  for 
any  two  homomorphisms  m:  A  ->  C  and  v:  B  ->  C  there  exists  a  unique 
homomorphism 

(1.7)  U®  V:  A  ®  B  -»C:  Z  a.  ®  b.  -->£  u(a.)v(b.). 

(1.8)  LEMMA.   Let  A,  B,  C  be  k- algebras.   Then: 

(a)  The  assignment  (u,  v)  -»  U  ®  V  establishes  a  bisection 
between  Horn  (A,  C)  x  Horn  (B,  C)  and  Horn  (A  ®  B,  C). 

(b)  X(A  ®  B)  is  naturally  identified  with  X(A)  X  X(B). 

(c)  If  A,  B  are  k-reduced,  so  is  A  0  B. 

PROOF,   (a)  Write  j'.   A  ->  A  ®  B:  a  h>  a  ®  1  and 
j  :  B  ->  A  ®  B:  b  *-->  1  ®  b.  We  then  define  the  inverse  of  the 
assignment  (u,  v)  *->  u  ®  v  as  Horn  (A  ®  B,  C)  -.Horn  (A,  C)  x 
x  Horn  (B,  C):  7  *■*  (7°^,  7°dg)- 

(b)  Apply  (a)  to  C  :=  k. 

(c)  Since  X(A  ®  B)  has  been  identified  to  X(A)  X  X(b),   it 
is  enough  to  prove  the  following:   if  (u,  v)(c)  =  0  for  some  c  in 

A  ®  B  and  for  all  (u,  v)  in  X(A)  X  X(b).   then  c  --,  0.  Assume  that 
c  4   °>   and-  express  c  as  a  finite  sum  Z  a.  ®  b.  with  the  b. 
linearly  independent  over  k.  For  any  V     in  X(A),   consider  the 
element  b  :=Z  u(a.)b.   of  B.   For  any  V  in  X(b), 
v(b)  =  Z  u(a. )v(b.)  -  (u,  v)(c)  =  0.   Since  B  is  k~ reduced,  the 
function  b  =  0.  The  b.  being  linearly  independent,  it  follows  that 
u(a.)  =  0  for  all  i.  Since  A  is  k-reduced  and  u  was  arbitrary, 

a.  =  0  for  all  i.  So  c  -  0. 

1 


(1.9)   REMARK.   Recall  that,  in  any  category,  the  coproduct  of  a 

family   {IL  ,  A  G  A}  of  objects  is  defined  as  an  object  T  -   II  T   and 

morphisms  U.:  T '  -»  T  which  satisfy:   for  any  object  9  and  morphisms 

q   .  <v     ^9  there  exists  a  unique  morphism  v>:    T  -»  0  such  that 

A   A 
uo^Ok  =  9-,   for  all  A.  An  equivalent  way  of  expressing  the  properties 

(1.8. a)  and  (l.8.c)  is  to  say  that  A  ®  B  (together  with  the  inclusions 
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a  h  a  3  1,  bnlgb)  is  the  eoproduct  of  A  and  B  in  the  category 

red 
Alg     of  k- reduced  algebras,  having  (k-algebra)  homomorphisms  as  its 

morphisms. 

By  induction,  the  eoproduct  of  any  finite  family  A  ,  ....  A 

red  1       n 

in  Alg     is  A.  8  ...  8  J!  .   In  the  case  of  the  category  Alg   of 

all  k- algebras  (as  in  all  algebraic  categories),  an  arbitrary  (not 

finite)  eoproduct  exists .   In  fact,  this  eoproduct  can  be  obtained  by 

the  following  direct-limit  construction.   Consider  first  the  disjoint 

union  of  all  [&V  A,,  ADA  =  finite}.   Then,  for  each  pair  A  cA', 
Ao  A     —  o  o  —  o 

identify  ®A  A,   with  the  image  of  the  inclusion  morphism 
A0   a 

®.   A,  -40,,  A,   obtained  by  adding  coordinates  equal  to  1  in  the 
Ao  ^    Ao  A 
positions  A  in  A'  -  A  .   The  algebra  A  obtained  from  this 
o    o 

construction  is  the  eoproduct  of  the  A-.   in  Alg  .  If  all  A^  are 

k- reduced,  so  is  A.   Indeed,  by  the  construction  just  sketched,  an 

element  x  of  A  is  in  some  (finite)  tensor  product  ®.  A..,   and 

Ao  A 
every  homomorphism  ®.  A,  -^  k  extends  to  a  homomorphism  A  -* k  (just 

define  A,  ~» k  arbitrarily  if  A  is  not  in  A  ).  Assume  now  that  x 

is  in  the  kernel  of  all  homomorphisms  A  -> k.  Then  x  is  in  the  kernel 

of  all  homomorphisms  ^  A,  -»  k,   so  by  (2.8.c),   x  =  0.  Therefore  A 

1  o  ^ 
is  k-reduced,  and  a  fortriori  A  is  the  eoproduct  of  the  A^   in 

red 

Alg   .   Thus  arbitrary  eoproduct s  exist  in  the  category  of  k-reduced 

algebras .  Moreover,  the  construction  shows  that  A  -  II  A,   includes  all 
the  A,  ,  and  that  A  is  generated  by  the  k- algebras  A..  .   Finally, 
observe  that  the  categorical,  definition  of  eoproduct  given  above,  applied 
to  0  :=  k,   shows  that  the  set  X(A)  of  morphisms  A  ->  k  is  identified, 
through  composition  with  the  inclusion  homomorphisms  A,  -4  A,  with  the 
set  II  X(A-)  of  families  of  homomorphisms  A,  ~>  k. 

(1.10)  DEFINITION.   Let  A  be  an  algebra  of  functions  on   a  set  X. 
Take  x,  y  j_n  X.  Then  A  separates  x  and  y  iff  there  exists  an  a 
in  A  such  that  a(x)  5/  a(y). 

(1.11)  DEFINITION.  Let  A  be  a  subalgebra  of  the  k-reduced  algebra  B. 
Then  A  is  maximally  separating  with  respect  to  B  iff  there  is  no .  C 
such  that  A  C  C  C  B,   and  C  separates  the  same  points  of  X(B)  as  A. 


:■ 


We  shall  later  make  use  of  the  following 

(1.12)  EXAMPLE.   Let  A  :  =  k[TlJ,   n  >  0]   he  the  subalgebra  of 

k[T,,  T_J  generated  by  all  the  monomials  T  T  ,   n  >  0.   Then  A  is 
1  t±  i  d  —  — 

maximally  separating  with  respect  to  B  :=  k[Tn,  ^J  ■  Note  that  A 

identifies  all  points  of  the  form.  (0,  x  )  and  separates  any  other 

pair  of  points  in  X(B)  =  k  ,  Assume  that  there  is  a  C  as  in  (l.ll). 

Take  P(T„ ,  T,)  =E  a.  .T^T^,  in  C.   Since  C!  separates  no  more  points 
v  1'   2       ij  1  2  _     -j 

than  A,   P(0,  X  )  =  P(0,  0)   for  all  Xg.   Therefore  L   a0-jT2  is  a 
constant  polynomial.  Since  k  is  infinite.   P  has  no  terms  in  T?, 
d>0.  So  P  =  aoQ  +  .^  a. jTJ^CT^)  is  in  A.  □ 

Recall  (BOURBAKI  [1972,  Chapter  V] )  that  the  algebra  A   is 
integral  over  the  subalgebra  B  iff  every  a  in  A  satisfies  an 
equation 

a"  +  bian"1  +  ...  +  bn  =  0, 

for  some  b.,   j  =  1,  ...,  n  in  B  and  for  some  n  >  0. 

A  k-algebra  A  is  finitely  generated  iff  there  exists  a  finite 
subset   (a  ,  ...,&}     of  A  such  that  each  element  of  A  can  be 
expressed  as  a  finite  combination  of  a.,  . ...  a^  using  sums,  products, 
and  multiplication  by  elements  of  k. 

(1.13)  LEMMA.  Let  the  k-algebra  A  be  an  integral  domain,  and. assume 
that  A  is  integral  over  a  subalgebra  B. 

(a)  If  B  is  a  finitely  generated  k-algebra  and  Q(A)   is  a 
finitely  generated  field  extension  of  Q(b).   then  A  is  a  finite 
B-module  and  so  also  finitely  generated  as  a  k-algebra. 

(b)  If  A  is  a  finitely  generated  k-algebra  then  B  is  also 
finitely  generated. 

PROOF.   (a)  This  is  an  easy  consequence  of  BOURBAKI  [1972, 
V.3.2,  Theorem  2] . 


(b)  Let  an,  ....  a   generate  A.  Each  a.   satisfies  an. 
In  1 

integral  equation  with  coefficients  b.  .   in  B;   call  C  C  B  the 

k~algebra  generated  by  all  the  b. ..   Then  A  is  integral  over  C, 

and  so  B  is  integral  over  C.   By  (a),   B  is  finitely  generated.    D 


The  Zariski  Topolos 


For  the  rest  of  this  chapter,  unless  the  contrary  is  explicitely 
stated,   A,  B,  C,  ...  will  always  denote  k-reduced  algebras. 

Recall  that  X(A)  denotes  the  set  of  all  homomorphisms   |i:  A  ->  k. 

¥e  now  introduce  an  operator  V  which  assigns  a  subset  V(s)  of 
X(A)  to  each  subset  S  of  A.   It  is  defined  as 

(2.1)  V(S)  :=  (x  in  X(A)  j  a(x)  =  0  for  all  a  in  S). 

Thus  V(S)  is  the  set  of  solutions  of  the  simultaneous  equations 
a(x)  =0,   a  in  S.  Since,  by  convention  a(x)  means  x(a),  where 
x:  A  -» k  is  a  homomorphism,  we  can  give  the  equivalent  definition 

(2.2)  V(S)  :=  {x:  A  ->  k  |  ker  x  D  S}. 

(2.3)  PROPOSITION.   The  operator  V  satisfies  the  following 
properties: 

(a)  V(S)  =  X(A)  if  and  only  if  S  =  {0}. 

(b)  V(A)  -  0. 

(c)  SCT  implies  V(T)CV(S). 

(d)  Let   (S)  denote  the  ideal  of  A  generated  by  S.  Then 

v(s)  =  v({s». 

(e)  V(  U  {SA,   A  in  A})  =  H  fv(S^),   A  in  A}. 

(f)  Let  I.J  denote  the  product  of  the  ideals  I,  J  of  A, 
i.e.  the  ideal  generated  by   {ab,  a  in  I,   b  in  J).   Then 

V(I)UV(J)  =  V(lOj)  =  V(I.J). 
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PROOF,   (a),  (b),  (c),  (d),   and  (e)  are  easy  consequences  of 
the  definition  of  V. 

We  now  prove  (f).   By  (c),   iflJCI  and  iOjCJ  imply 
V(I)  U  V( J)  C  V(I  n  J)  .   Similarly,   I.JCI  Pi  J  implies 
V(lHj)  CV(I.J).  Consequently,  it  will  be  sufficient  to  show  that 
if  x  is  not  in  V(l) U  v(j)  then  x  is  not  in  V(l.J).  But  if  x 
belongs  to  neither  V(l)  nor     V"(j),   there  are  a  in  I,  b  in  J 
such  that  a(x)  ^  0,  b(x)  ^  0.  So  (a.b)(x)  -  a(x)b(x)  /  0.  Since 
ab  £  I.J,  x  is  not  in  V(l.j). 

It  follows  from  (a),  (b),  (e),  and  (f)  above  that   {V(s),  SC.1) 
is  the  family  of  closed  sets  for  a  topology  on  X(A),   called  the 
Zariski  topology.  Therefore  we  shall  henceforth  refer  to  sets  of  the 
type  V(S),  S  C  A,  as  closed  sets. 

Occasionally  it  is  convenient  to  define  closed  subsets  of  X(A) 

in  an  indirect  manner.  Let  A,   B  be  any  k- reduced  algebras,  and 

consider  the  tensor  product  A  3  B.  Let  S  be  any  subset  of  A  ■£>  B, 

For  any  s  =  Z  a.  8>  b.   in  S  and  any  x  in  X(A)  let  s(x)  be  the 
element  Z  a.(x)b.   of  B. 

(2.1+)   LEMMA.   V^S)  :=  {x  £  X(A)  |  s(x)  -  0  for  all  s   in  S}  is 
a  closed  subset  of  X(A). 

PROOF.   Let   {b,  ,  A  in  A]  be  a  basis  for  B  as  a  k- vector 
A 

space.  Each  s  can  be  expressed  in  the  form  Z  a^   55  b   (finite  sum), 

so  s(x)  -I  a,   (x)b  .   Since   {b,  }   is  linearly  independent, 
A,s     A  A 

p 
V  (S)  =V({ax    in  A,   s   in  S,   A  in  A}).  C 

We  now  define  an  operator  I  which  associates  a  subset  l(Z)   (in 
fact,  an  ideal)  of  A  to  each  subset  Z  of  X(A).   It  is  defined  as 

(2.5)   I(Z)  :=  {a  in  A  |  a(x)  =  0  for  all  x  in  Z}. 

Thus  l(Z)  is  the  annihilator  of  Z;  it  is  evidently  an  ideal  of  A. 
We  can  define  l(Z)  equivalently  via  homomorphisms,  which  gives 
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(2.6) 

Kz) 

:  =  f  i  {ker  x,  x  in  Z}. 

(2.7) 

PROPOSITION.   The  operator  I  has  the  following  properties: 

(a) 

I(X(A))  =  {0}. 

(b) 

z1  cz2^i(z?)  ci(zx). 

(c) 

Z  C  V(I(Z)),   S  C  I(V(S)). 

(a) 

I  (  U  {Z, ,  A  in  A})  =n{l(VA),  A  in  A}. 

(e) 

l(V(S))  is  the  intersection  of  all  the  k-ideals 

contai 

ling 

o  . 

(f) 

V(I(Z))  =  {x  in  X(A)  |  ker  x  D  Qz   ker  z}. 

PROOF,   (a),  (b),  (c),  and  (d)  are  easy  consequences  from  (2.5), 


(2.6). 


(e)  Observe  that,  by  (2.6), 
I(V(S))  =0  {ker  x,  x  in  V(S)J, 

and  by  (2.2), 

x  is  in  V(S)   iff  S  C  ker  x. 

(f )  Similar  to  (e) . 

(2.8)   REMARK.   Using  (2.3)  and  (2.7)  it  is  easy  to  verify  the  following 
facts: 

(i)  I(V(I(Z)))  =  I(Z), 

(ii)  V(I(V(Z)))  =  V(Z), 

(iii)   the  Zariski  closure  of  any  Z  C  X(A)   is  V(l(Z)),   and 

(iv)  an  ideal  I  is  of  the  form  l(z)   if  and  only  if 
I  =I(V(I)). 

Ideals  as  in  (iv)  are  called  closed;  this  terminology  comes  from 
regarding  S  «->l(v(S))  as  an  algebraic  (not  topological)  closure  operator. 


27 


One  could  have  deduced  (i)-(iv)  from  the  fact  that,  by  (2.3c), 
(2.7b)  and  (2.7c),  the  pair   fV,  I)  constitutes  a  duality  or  Galois 
connection  (KUROSH  [1965,  §  51]).  Finally,  it  also  follows  that   (V,  I] 
establish  an  inclusion-reversing  bijection  bet-ween  closed  ideals  of  A 
and  closed  subsets  of  X(A).  q 

(2.9)  LEMMA.   For  any  ideal  I  _of  A,   X(A/l)   can  be  naturally 
identified  with  V(l);  therefore  the  canonical  map  v.   A  ->A/l  can 
be  naturally  identified  with  the  restriction  map  a  >->a|v(l),  where 
a  is  viewed  as  a  function  on  X(A) .  Furthermore,   I  is  closed  if 
and  only  if  A/l  _is  k- reduced. 

PROOF.  From  elementary  algebraic  considerations,  the  injection 
X(A/l)  ->X(A):  y  ;->  y°7T  shows  that  there  is  a  bijection  between 
functions  y  in  X(A/l)  and  those  functions  x  =  y°7r  in  X(A)  for 
which  x|l  =  0,  that  is,  I  C  ker  x.  These  are  precisely  the  x  in 

v(i). 

To  prove  the  second  part,  note  that  k-ideals  of  A/l  correspond 
via  v    to  those  k-ideals  of  A  which  contain  I.  Thus  rai\   AA  =  ° 
precisely  when  I  is  the  intersection  of  all  the  k-ideals  containing 
it.  Applying  (2.7e)  to  (2.8iv)  gives  the  proof.  □ 

We  need  the  following  elementary  topological 

(2.10)  DEFINITION.  A   topological  space  Z  is  irreducible  iff  Z  is 
not  the  union  of  two  proper  closed  subsets,  in  other  words,  iff 

Z  =  Z  Uz„,   Z.   closed,  implies  Z  -   Z,   or  Z  -  Z_. 

1    2'        1 2 — —       1  —       2 

Clearly,   Z  is  irreducible  iff  any  two  nonempty  open  sets  of  Z 
have  a  nonempty  intersection,  i.e.  iff  any  open  subset  of  Z  is  dense. 
Therefore,  irreducibility  permits  the  use  of  local  intuition  and  methods 
(=  arguments  about  neighborhoods)  in  the  proof  of  global  statements. 

To  apply  the  above  concepts  in  our  context,  we  must  study  more 
closely  the  Zariski  topology  in  the  spaces  X(A).   This  topology  will 
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in  general  not  be  Hausdorff  (---  different  points  having  disjoint 
neighborhoods),  hut  it  is  true  that  each  point  of  X(A)   is  a  closed 
set.   Indeed,  by  (2.7'f),   {x}  =  {z  |  ker  z  D  ker  x};   since  ker  x  is 
a  maximal  ideal,  it  follows  that   fx)  =  [x]  as  wanted. 

A  set  Z  CX(A)  can  be  given  the  subspace  topology;  thus 
irreducibility  of  Z  is  well  defined.  In  particular,  it  is  easy  to 
see  that  a  closed  subset  V  of  X(A)   is  irreducible  iff 

(2.11)  VCVUV,   V.   both  closed  in  X(A),   implies  V  G  V,   or 
V  C  V  . 

Recall  that  an  ideal  F  of  A,   P/  A,   is  prime  iff,  for  any 

ideals  J,  J"2?  J..J„  C  P  implies  J,  C  P  or  J?CP.  Equivalently, 

P  is  prime  iff,  for  any  a,  b  in  A,   ab  in  P  implies  either  a 

is  in  P  or  b  is  in  P;   see  BOURBAKI  [1972,  II. 1.1].  We  then  have: 

(2.12)  LEMMA.   Let  V  be  a  closed  subset  of  X(A).   Then  V  is 
irreducible  if  and  only  if  l(V)  is  prime. 

PROOF.   ["only  if"]  Assume  J.J2Cl(v).   Then,  by  (2.3c), 

V  =  V(I(V))  C  V(J,)  UV(J  ).   Since  V  is  irreducible,   V  C  V^)   or 
VCV(J2).   Thus  by  (2.6c, b)  J  C  l(V(j) )  C  l(v)   or 

J2  CI(V(J2))  CI(V). 

["if"]  Assume  V  =  V  U  V,  V±     closed.  Then  by  (2. 7d) 
I(V  ).I(V2)  Cl(V)Oi(V2)  =  I  (V  Uv2)  =  I(V).   This  and 
I ( V)  =  prime  imply  l(V  )  C  l(V)  or  l(V0)ci(v).  Thus 

V  -  VT(V)  C  VI(V  )  =  V   or  VCVg. 

In  particular,  X(A)  is  irreducible  iff  A  is  an  integral  domain. 

It  follows  from  (2.12)  that  V  m  l(V)   and  I  •->  V(l)   establish  an 
inclusion-reversing  bisection  between  irreducible  subsets  of  X(A)   and 
prime  ideals  of  A . 
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3 .  k-Spaees . 

A     k-space  is  the  abstract  version  of  a  space  of  the  type  X(A): 

(3.1)  DEFINITION.  A  k-space  (X,  A(x)),   or  simply  X,   is  a  set 
X  and  a  k-algebra  A(X)  of  functions  X  -)  k  such  that  the  map 
X-»X(A(X)):  XM>e   (=  evaluation  at  x)  is  bijective.  The  elements 
of  A(X)  are  the  P^lyg^^aj^£HHJj:gi1_s_t_g£  x- 

The  terminology  "polynomial  functions"  is  motivated  by  Example  (l.k). 

We  always  consider  a  k-space  X  as  a  topological  space,  with 
topology  induced  from  the  Zariski  topology  on  X(A)  by  the  bisection 
X  -  X(A(X)).  We  shall  see  presently  that  all  k-spaces  are  essentially 
of  the  type  (X(A),  t(A)),  where  A  is  k- reduced  an  1     is  the  map 
introduced  in  (1.2). 

(3.2)  DEFINITION.  Let  X,  Y  be  k-spaces.  A  map  f:  X  -> Y  is 
polynomial  iff  for  each  polynomial  function  b:  Y  -*  k  in  A(Y)  the 
composition  bof:  X  -> k  is  a  polynomial  function  in  A(X). 

We  shall  use  X,  Y,  Z  to  indicate  k-spaces;   f:  X  -*Y  will 
always  mean  a  polynomial  map. 

It  can  be  trivially  verified  that  k-spaces  as  objects,  together 
with  polynomial  maps  as  morphisms,  constitute  a  category.  In  this 
category  we  have  the  following 

(3.3)  LEMMA.  Every  k-space  is  isomorphic  to  a  k-space  of  the 
type  (X(A),  i(A)),  where  A  is  k-reduced  and  1     is  the  map 
introduced  in  (1.2). 

SKETCH  OF  PROOF.  Given  a  k-space  (X,  A(X)),  let  A  :=  A(X). 
An  easy  check  shows  that  X  ->X(A):  x  »  ex  is  the  required  isomorphism 
of  k-spaces. 

Since  all  results  will  be  stated  up  to  isomorphism,,  we  are  justified 
by  (3.3)  when  carrying  out  our  proofs  about  k-spaces  only  for  those  of 
type  (X(A),  t(A)).  We  continue  to  use  our  convention  of  identifying 
t(A)  with  A  (so  A(X(A))  =  A)  when  there  Is  no  danger  of  confusion. 
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A  major  role  is  played  by  the  k- spaces  introduced  by  the  following 

(3.U)   DEFINITION.  A  k-space  X  is  a  variety  iff  A(X)  is  a 
finitely  generated  k- algebra . 

Let  a  ,  . ..,  a   generate  the  k-algebra  A.  Generation  is 

equivalent  to  ontoness  of  the  k-algebra  homomorphism 

u:  k[T_,  ...,  T  ]  -»A  defined  by  the  assignment  T.  (->  a. .  By  (l.k) 
in  11 

and  (2.9)  we  see  that  (X(A),  t(A))  is  isomorphic  to  (V,  B),  where 
V  is  the  set  of  points  x  in  k   which  satisfy  all  equations 
f(x)  =0,   f  in  ker  u,   and  where  the  functions  in  B  are  the 
restrictions  to  V  of  the  polynomial  functions  k  -»k.  There  are 
many  possible  representations  (V,  B)  for  each  A,   depending  upon 
the  choice  of  generators  a  ,  ...,   a  .  Thus,  (affine)  varieties  are  the 
coordinate-free  versions  of  those  subsets  of  k   defined  by  polynomial 
equations .  When  studying  varieties,  it  is  usually  simpler  to  deal  with 
representations  of  the  type  (V,  B).  Note  that  if  (V  ,  B  )  and 
(Vp>  B  )  are  (concrete)  varieties,  where  V  c  k   and  V    Ck,   a 
map  f :  V..  -»  V^  is  polynomial  precisely  when  f  is  defined  by  an 
m- vector  of  n- variable  polynomials . 

(3.5)   DIGRESSION.   Let  k  =  R,   the  reals.   Then  varieties  have  a 
natural  topology  induced  from  their  embedding  in  Euclidean  space  with 
the  usual  topology.   This  topology  is  finer  than  the  Zariski  topology; 
(for  instance  the  only  proper  Zariski-closed  subsets  of  R   are  finite 
sets).  More  generally,  given  any  normed  field  k,   we  may  define  a 
strong  topology  on  k-spaces  by  choosing  as  a  basis  of  open  sets  all 
finite  intersections  of  sets  of  the  type  f  (N),   for  all  polynomial 
functions  f  and  all  open  sets  N  in  k,   i.e.  the  coarsest  topology 
in  X  for  which  all  f  in  A(X)  become  continuous  for  the  normed 
topology  of  k.   (See  for  instance  SHAFEKEvTCH  [1975,  Chapter  ?]  for 
the  case  of  varieties  over  C. )  For  the  purpose  of  this  work, 
realization  theory,  we  are  mainly  interested  in  (global)  questions  of 
structure;  thus  we  shall  use  only  the  Zariski  topology,  even  in  the  cases 
k  =  R  or  C,   except  for  the  proof  of  some  technical  facts  on  varieties 
over  R  in  Section  k.  □ 
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(3-6)   DEFINITION.   Let  g:  X  ■->  XQ .   The  transpose  of  g  is  the 
homomorphism  A(g):  A(X0)   ->A.(X  )  defined  by 

A(g)(f)  :=  f°g. 

(3.7)   LEMMA.   Fix  two  k- spaces  X  ,  X  .   Tlien  the  assignment 

A:  g  nA(g)   establishes  a  bijection  between  polynomial  maps  X-L  -» X 

and  k- algebra  homomorphisms  A(X„)  ->A(X  ). 

We  shall  write  X(u):  X(B)  -»X(A)   for  the  polynomial  map 
corresponding  to  the  homomorphism  H:  A  ->  B. 


PROOF.  The  problem  is  to  define  an  inverse  X  of  the  transpose 
functor  A.  Let  the  k-spaces  X   be  (x(A . ),  t(A.)),   j  =  1,  2 

Y  .  u  Jo  . 

and  t.:  A.  -» k  "u  the  canonical  maps.  Let  a:  t  (A2)  ->  ^(A^  and 

take  x  in  X.  Since  e  :  ^(A  )  ~>  k  is  a  homomorphism, 

e  °a°  1  •  A„  -» k  is  also  a  homomorphism.  Now  define 
x    2   2 

X(a):  X(A  )  -*X(Ag):  a-^ex<=aH2. 

It  is  easy  to  verify  that  X(A(g))  =  g  and  A(X(a))  =  a  for  all 
g:  X(A  )  ->X(A2)  and  all  a:  t2(Ag)  -^^(A^. 

(3.8)   COROLLARY.   The  category  of  k-spaces  is  dual  (arrows  reversed) 
to  the  category  of  k- reduced  k- algebras . 

The  above  duality  allows  the  translation  of  constructions  and 
statements  about  algebras  into  (dual)  statements  about  k-spaces.  and 
vice  versa.  For  instance,  (1.8)  says  that  the  categorical  product 
XX  Y  of  two  k-spaces  X.  Y  is  the  k-space  X(A(X)®A(Y))   and 
that  the  underlying  set  of  this  k-space  is  the  cartesian  product 
X  X  Y.  By  induction,   X(A  )  X  X(Ag)  X  ...  X  X(An)  -  X(AX  S  ...  ®  Aq) . 
And,  in  particular,   (X(k[T]))   (n-th  fold  power)  coincides  with 
X(k[T  ,    ...,  T  ])  =  k  ;   see  Example  (l.U).   This  also  shows  that  the 
notation  k   is  consistent  with  products  in  the  category  of  k-spaces. 
(Note  that,  in  particular,   k  =  X(k)  =  one  point,  say   {0}). 


D 


□ 


on 
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As  an  example  of  the  transpose  construction,  consider  a  functi 
f:  X  -*k.   Since  the  transpose  A(f):  k[T]  -»A(X)  is  a  k-algebra 
homomorphism,,  A(f)  is  determined  by  A(f)(T)  =  T°f.  Since  T  is 
the  identity  map  on  k, 

(3-9)   A(f)(T)  =  f. 

So  the  transpose  of  f  is  the  map  P(t)  ^p(f). 

We  now  relate  various  properties  of  polynomial  maps  to  properties 
of  their  transposes. 

(3-1°)  DEFINITION.  A  polynomial  map  f:  X  -»T  ^dominating  iff 
f(X)  =  Y;  f  is  a  clos^d^_embedding  iff  f  can  be  factored  as  g  °gn, 
where  gg  is  an  isomorphism  X  *  y  and  g   is  the  inclusion  map 
V  -»Y,  for  some  closed  subset  V  of  Y. 

(3.11)  LEMMA.  Let  a-.   A  -» B  and  denote  f  :=  X(a):  X(B)  -»X(A). 
Then 

(a)  f_1(V(S))  =  V(a(s))  for  any  SCA. 

(b)  f  is  continuous. 

(c)  f(V(l))  =  V(a"  (I))  for  any  closed  ideal  I  of  B. 

(d)  f  is  dominating  if  and  only  if  a  is  one-to-one. 

(e)  f  is  a  closed  embedding  if  and  only  if  a     is  onto. 

PROOF,   (a)  x  is  in  f_1(v(s)) 

iff  f(x)  is  in  V(s), 
iff  a(f(x))  =  0  for  all  a  in  S, 
iff  a(a)(x)  =  0  for  all  a  in  S, 
iff  x  is  in  V(o;(s)). 

(b)  All  closed  sets  in  X(A)  are  by  definition  of  the 
form  V(S).  By  (a),  pre- images  of  closed  sets  are  closed. 

(c)  We  first  prove  that  a     (l)  =  l(f(v(l ) ) );  in  fact  the 
following  statements  are  equivalent: 


a  is  in  l(f(V(l))), 

a(f(x))  =  0  for  all  x  in  V(l), 

a(a)  is  in  l(V(l))  =1  (I  =  closed'.), 

a  belongs  to  a  (i). 


Therefore  V(a  (i))  =  V(l(f(V(l))))  =  f(v(l)),   as  required. 


-1, 


(d)  Applying  (c)  to  I  =  {0},   V(a"  (i))  =  f(X).   So  f  is 
dominating  iff  V(ker  a)  =  Y,  which  by  ( 1.11a)  is  equivalent  to 
ker  a  =  {0}. 

(e)  Dualizing  (3. 10),   f  is  a  closed  embedding  iff  the 
transpose  homomorphism  a     factors  as  3p°f3 n,  where  f3   is  a 
homomorphism  B  -» B/l  for  some  ideal  I  and  BQ  is  an  isomorphism. 
Such  factorizations  exist  precisely  when  a     is  onto. 

Both  dominating  maps  and  closed  embeddings  will  play  important 
roles  in  our  treatment  of  realization  theory.  We  emphasize  some 
intuitive  aspects  of  these  concepts  through  the  following 

(3.12)  DISCUSSION.  It  follows  from  (e)  above  that  a  onto  implies 
that  f  is  one-to-one.   The  converse  is  false.  For  an  easy  example, 
consider  X  =  Y  =  k  :=  R,   f,  (x)  :  =  x  .  Then,   f   is  one-to-one, 
but  A(f  ):  k[T]  -^k[T]:  T  h>  T      is  not  onto  (T  is  not  in  the  image) 

The  problem  does  not  lie  in  the  fact  that  R  is  not  algebraically 

~         2      ,  2   3N 
closed:   for  any  field  k  we  may  consider  f„:   k  ->  k  :  x  •->  (x  ,  x  j; 

f   is  one-to-one  but  A(f  ):  k[T  ,  Tg]  -»k[T]:  T  h>  T  ,  T£  ->  T5 

has  image  k[T  ,  T3]  ^  k[T]. 

A  variation  of  the  last  example  provides  a  bijective  map  f 

3 

which  is  not  an  isomorphism.  Indeed,  consider  the  "cusp" 

Y  :=  {(x,  y)  in  k2|x3  =  y  }. 
Then 

(3.13)  ty   k  -»Y:  x  ^(x£,  x3), 
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is  bijective.  But  f  is  not  an  isomorphism,  because,  by  the 
equivalence  of  categories  (1.7),   f,  is  an  isomorphism  iff  A(f  )   is 

o   3  3 

an  isomorphism.   But  A(Y)  ~  k[T  ,  T3]  i   kfT^;   see  DIEUDCOTE  [1974,  1.1, 
Example  5l •  Intuitively,  we  cannot  expect  to  have  any  isomorphism 
between  k  and  Y  because  the  curve  Y  has  a  singularity  (at  the 
origin)  while  the  line  k  has  none. 

It  is  not  difficult  to  prove  that,  in  the  category  of  k-spaces, 

monomorphism  =  one-to-one  and  epimorphism  =  dominating.  A  dominating 

map  is  in  general  not  onto.  This  is  illustrated  over  k  =  R  by 

2 
X  =  Y  :=  R,   f(x)  :=  x  ;  this  is  a  dominating  map  because  the 

smallest  Zariski  closed  set  containing  the  nonnegative  reals  is  all  of 

R_.  In  the  particular  case  when  k  is  algebraically  closed  and  X  is 

an  irreducible  variety,  a  dominating  f  becomes  almost  onto,  in  the 

sense  that  f(X)  contains  a  Zariski  open  subset  of  Y;   see  (3.IU) 

below.  Thus  in  this  particular  case  f(X)  is  all  of  Y  except  at 

most  for  a  subset  of  "lower  dimension"  (to  be  made  precise  later). 

Moreover,  it  can  be  proved  that,  when  k  -  C  and  Y  is  given  the 

strong  topology  (j>.h),      f(X)  has  a  nowhere  dense  complement. 

We  remarked  above  that  the  image  of  a  polynomial  map  is  in  general 
not  a  closed  set.  When  f:  X  -» Y  is  a  polynomial  map  between  two 
varieties,  one  can  sometimes  characterize  f(X)  as  a  set  defined  by 
polynomial  equalities  and  inequalities.  A  constructible  subset  C  of 
a  variety  X  is  a  finite  union  of  sets  of  the  type  uHf,   where  U 
is  open  and  F  is  closed.   In  other  words,   C  is  in  the  Boolean 
algebra  generated  by  the  Zariski  topology  of  X.  When  k  is  a  real- 
closed  field  (JACOBSON  [1964,  VI.2]),  like  k  =   R,   we  define  an 
real-constructible  set  C  as  a  finite  union  of  sets  ?Of,  where  F 
is  closed  and  U  is  of  the  type   [x  in  x|f(x)  <  0}  for  some 
polynomial  function  f. 

(3.14)  THEOREM.   Let  X,  Y  be  varieties  and  let  f:  X  ~>Y.   Then 

(a)  If  k  is  algebraically  closed  and  C  is  a  constructible 
subset  of  X  (e.g.  C  =  X),  then  f(x)  is  a  constructible  subset  of  Y. 
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(b)  If  k  is  a  real-closed  field  (e.g.   k  =  R)   and  C   is  a 
real-constructible  subset  of  X   (e.g.   C  =  X),   then  f(x)   is  a 
real-constructible  subset  of  Y. 

(c)  If  f  is  dominating,   Y  is  irreducible,  and  k  is 
algebraically  closed,   f(X)   contains  a  (Zariski)  open  subset  of  Y. 

(d)  If  f  is  dominating,   Y  is  irreducible  and  k  =  g,   then 
f(X)  contains  a  set  open  in  the  strong  topology  of  Y. 

PROOF.   (a)   This  is  the  well-known  CHEVALLEY's  theorem;  see 
for  instance  DIEUDONNE  [19?U,  Chapter  k,   Corollary  to  Proposition  lk]* 

(b)  This  statement  is  essentially  the  generalized  STURM1  s 
Theorem  due  to  TARSKE  and  SEIDENBERG;  see  JACOBSON  [196k,   VI. 10].   (The 
usual  statement  of  the  TARSKE-SEIDENBERG  result  requires  that  the 
coefficients  of  f  be  rational,  so  that  f(C)  can  be  constructed 
algorithmically.  However,  the  proof  itself  does  not  depend  upon  this 
requirement;  see  SEIDENBERG  [195J+,  footnote  in  page  366].) 

To  prove  (c)  and  (d),  write  f(X)  =  fi5^te  Ui  ^ Fi  as  in  the 

definition  of  construct ibles  and  real-const ructibles,  with  the  F. 

1 

closed  and  the  U.   Zariski- open  or,  when  k  =  g,   of  the  type 
{f(x)  <  0),   so  open  in  the  strong  topology.  If  F.  ^  Y  for  all  i, 
then,  by  irreducibility  of  Y,   f(x)  = U  F.  ^  Y,   contradicting 
domination  of  f.  So  some  F.  =  Y,   and  f(X)  contains  U..  C 

An  important  type  of  dominating  map  arises  in  the  following 

(3.I5)  DEFINITION.   The  principal  open  set  defined  by  a  £  A  is 

D(a)  :=  {x  in  X(A)  |  a(x)  ^  0). 

The  principal  open  sets  constitute  a  basis  for  the  Zariski  topology. 
Indeed,  the  complement  of  any  closed  set  V(s)   is  the  union  of  the 
D(a),   a  in  S.  For  simplicity,  let  A  be  an  integral  domain.  Denote 
by  a"  A  the  algebra  A  C  a"  A  C  Q(A)   consisting  of  all  b/a  ,   b  in 
A,  n  >  0.  Take  any  a:   a"  A  ->k.  Then  p  :=  a\k:   A  ->  k  satisfies 
e(a)  £   0.  Conversely,  if  p:  A  -> k  and  p(a)  4   °  then  the  rule 
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a(b/a  )  :=  p(b)/p(a)   defines  the  (unique)  a     extending  p.  Therefore 
D(a)   is  the  image  of  the  map 


(3-16)  X(a_1A)  -*X(A), 


dual  to  the  inclusion  A  C  a  A.  This  map  is  both  one-to-one  and 
dominating  and  establishes  a  homeomorphism  between  X(a  A)  and  D(a); 
see  BOURBAKI  [1972,  II. k. 3,  Corollary  to  Proposition  13] .   Local 
arguments  about  k- spaces  are  often  simplified  by  restricting  attention 
to  principal  open  sets.  □ 

We  shall  be  especially  interested  in  "quotients"  of  varieties: 

(3.17)  DEFINITION.  A  k- space  X  is  an  almost-variety  iff  there 
exists  a  variety  X  and  a  dominating  polynomial  map  f :  X  -» X. 

By  definition  of  "dominating",  this  means  that  X  has  a  dense 

subset  f(X)  consisting  of  equivalence  classes  of  elements  of  X.   Let 

f:  X  -» X  as  above.  By  Hilbert's  basis  theorem  we  may  write 

X  =  X,U...Ux  ,  where  the  X.   are  irreducible  closed  sets;  see 
1         r'  1 

BOURBAKE  [1972,  III. 2. 10,  Corollary  3  of  Theorem  3].  From  the 

definition  of  irreducibility,  it  is  easy  to  prove  that  continuous 

images  and  closure  of  irreducible  sets  are  irreducible.  Hence 

X.  :=  f (X. )  is  an  irreducible  closed  subset  for  each  i.  Since 
iA   ^  1 

fix.:  X.  -> X.   is  dominating  and  each  X.   is  a  variety,  X  can  be 
'111  1 

written  as  a  finite  union  of  irreducible  almost -varieties. 

The  next  lemma  shows  that  every  irreducible  almost-variety  has  an 
open  (hence  dense)  subset  which  is  a  variety,  justifying  the  terminology 
"almost  variety" . 

(3.18)  LEMMA.   Let  X  be  an  irreducible  almost-variety.   Then  there 
exists  a  principal  open  set  D(a)  of  X  which  is  a  variety,  i.e.  such 
that  a  A(X)  is  finitely  generated. 

PROOF.   Let  f:  X  ->  X  be  dominating  with  X  an  irreducible 
variety.  Then  A(f):  A(X)  -»A(X)  embedds  A(x)  in  B  :=A(x);  we 
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identify  A(X)  with  its  image  A  C  B.  By  BOURBAKT  [1972,  V-3.1., 
Corollary  1]  there  is  an  a  in  A  and  T  ,  . . . ,  T   in  B  such 
that  the  finitely  generated  algebra  a  a     is  integral  over  the 
polynomial  ring  a_1A[T  ,  . ..,  T  ].  By  (1.13)  a_1A[T  ,  . . . ,  T  1  is 

a  finitely  generated  algebra.   Let  \(\t    ••«>  T  )>    •••>    p  (T,,  ...,  T  ) 
generate  a^AtT^  ...,  Tp]  .  Then  P^O,  ...,  0),  ...,  P^O,  ...,  0) 

generate  a  A.  Thus  a  A  is  finitely  generated  I 

We  shall  find  in  Chapter  IV  that  the  canonical  state-spaces  are  in 
general  almost- varieties.  In  particular  we  shall  give  an  input /output 
map  whose  canonical  state-space  is  that  given  by  the  following 

(3.19)  EXAMPLE.  Let  A  :»  k[T  "ij,   n  >  0] .  Since  ACkfT^  T£], 
X(A)  is  an  almost- variety.  Take  a  :=  T...  Then  a"  A  =  kfT.,  Tg,  T~  ] 
is  a  finitely  generated  algebra.  As  a  variety,  X(a  A)  can  be 

•z. 

represented  by  the  set  of  solutions   (x,  y,  z)  in  Ir   of  the  equation 
xz  -  1  =  0.  Note  that  X(a~  A)  can  be  also  naturally  viewed  as  the 

Q 

principal  open  set  x1  /  0  in  k  .  A  point  x  of  X(A)  not  in  D(a) 
satisfies  x(T. )  =  T ,(x)  =  0  and  (as  we  now  show)  it  also  satisfies 
xCT.Tp)  =  0  for  all  n.  This  is  not  trivial,  since  the  "proof" 
x(T,T  )  =  x(T  ).x(T  )  =  0.x(T  )  =  0  is  fallacious:  x(Tp)  is  not 
defined,  because  Tp  is  not  in  A.  One  way  to  prove  the  statement  is 
by  means  of  the  theorem  of  extension  of  places  (see  BOURBAKI  [1972, 
VI. 2. k,   Proposition  13]).  This  theorem  implies  that,  if  there  is  no 
extension  of  x  such  that  x(T?)  is  defined,  then  there  does  exist  an 
overring  of  A  containing  T"   and  such  that  x(Tp  )  =  0  (the  values 
of  the  extension  of  x,  though,  are  not  necessarily  in  k).  But  if 
this  is  the  case,  then  x^t")  =  x^T^MT^  )  =  0  for  all  n,  as 
wanted.  Therefore  the  complement  of  D(a)  consists  of  just  one  point 
and  X(A)  is  the  disjoint  union  of  the  variety  x(a  A)  and  this  one 
extra  point. 


k.     Dimension. 

There  are  various  possible  notions  of  dimension  for  a  k-space  X. 
All  these  notions  coincide  if  X  is  a  variety  and  k  is  algebraically 


closed,  but  this  is  not  true  for  more  general  X  or  k.  Thus  our  choice 
will  he  to  some  extent  arbitrary,  to  be  justified  by  the  results. 

Let  A  be  an  overring  of  B  having  no  zero  divisors.  Recall  that 

elements  &.,    ...,   !>       of  A  are  algebraically  dependent  over  B  iff 

there  exists  a  nonzero  polynomial  P  in  B[T  ,  ...,  T  ]   such  that 

P(in,  ...,  Z    )   =  0.  When  no  such  P  exists,  in,  ....   &       are 
1'    '  n  '   1     '      n 

algebraically  independent .  We  review  some  elementary  properties  of 
algebraic  dependence.  They  can  be  found,  for  instance,  in  ZARISKI  and 
SAMUEL  [1958,  11.12]. 

An  arbitrary  subset  L  of  A  is  algebraically  independent  iff 
every  finite  subset  of  L  is.  A  transcendence  basis  for  A  (over  B) 
is  a  maximal  algebraically  independent  set  LCA;  in  other  words,  if 
s  is  not  in  L,  then  lU{s)  is  algebraically  dependent.  All 
transcendence  bases  have  the  same  cardinality,  the  transcendence  degree 
trdeg^  A  of  A  over  B.  When  B  =  k,  we  denote  trdeg  A  just  by 
trdeg  A.  When  trdeg^  A  =  0,  A  is  algebraic  over  B. 

The  notion  of  transcendence  degree,  which  is  based  on  "dependence", 
is  analogous  to  that  of  dimension  of  vector  spaces . 

A  transcendence  basis  for  A  over  B  can  be  extracted  out  of  any 
system  of  generators  of  A  over  B.  In  particular,  when  A  is  a 
finitely  generated  B-algebra,  trdeg^  A  is  finite.   If  both  A,  B  are 
k- algebras ,  then 

(U.l)   trdeg  A  =  trdeg  B  +  trdegg  A. 

(U.2)   LEMMA.  (ZARISKI  and  SAMUEL  [1958,  II. 2,  Theorems  28  and  29]). 

Let  A,  B  be  integral  domains,  and  let  cp:  A  ->  B  be  onto.  Then 

trdeg  B  <  trdeg  A;  if  both  are  finite  then  equality  can  only  hold  when 

cp  is  an  isomorphism.  □ 

When  the  k-algebra  A  is  not  an  integral  domain,  one  can  still 
define  trdeg  A,  making  use  of  the  integral  domains   {A/P,   P  prime 
ideal  of  A};  just  let 
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(U.3)   trdeg  A  :=  sup  [trdeg  A/P,  P  prime  ideal  of  A). 

By  (4.2),  this  definition  is  consistent  with  the  basic  definition  for 
integral  domains. 

We  define  a  notion  of  dimension  for  a  k- space  by  setting 

(k.k)       dim  X  :=  trdeg  A(X). 

Observe  that  an  almost- variety  X  is  always  finite-dimensional,  since 
then  A(x)  is  included  in  a  finitely- generated  algebra. 

A  combinatorial  consequence  of  the  definition  of  dimension  is: 
(k.5)       LEMMA.   Let  dim  X  =  n  be  finite.  Let 

be  a  chain  of  irreducible  closed  subsets  of  X.  Then  s  <  n. 

PROOF.   Clear  by  (2.12)  and  induction  on  (k.2). 

Let  f:  X  -h>  Y  be  a  polynomial  map.  Since  f  is  continuous  and 
points  of  Y  are  closed,  the  fibers  f  (y),  y  in  Y,  are  closed 
subsets  of  X.  Therefore  each  fiber  is  a  k-space  and  as  such  has  a 
well-defined  dimension. 

We  shall  see  below  how  to  generalize  the  dimension  formulas  for 
linear  maps  (dim  X  -  dim  ker  f  =  dim  f(X))  to  the  polynomial  context. 
This  will  follow  from  a  general  result  on  the  structure  of  polynomial 
maps. 

The  next  theorem  summarizes  a  number  of  relevant  results  in  a  form 
convenient  for  our  purposes . 

(k.6)       THEOREM.  Let  X,  Y  be  two  irreducible  almost- varieties,  with 
dim  X  =  n,   dim  Y  -  m.  Let  f:  X  -»Y  be  a  dominating  polynomial  map. 
Then  there  exists  an  integer  s  >  0,   irreducible  varieties  X  ,  Y   and 
polynomial  maps  j'  :  X  -» X,   j  :  Y  ->  Y,   f  :  X  -» Y   and 
g:  X  -»Y1  x  kn-m  such  that 
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(a)  j   [ respectively  j  ]   identifies  X   [ respectively  Y  ] 
with  a  principal  open  set  of  X  [ respectively  Y]  as  in  (3,16). 

(b)  The  following  diagram  commutes: 


1 


where  pr..   denotes  the  projection  in  the  first  factor. 


(c)  g  is  dominating  and  the  sets  g  (z)  have  cardinality  at 


most  s  for  each  z  ui  Y  X  k 


(d)  If  k  is  algebraically  closed,  then  g  is  onto  and  each 
g  (z)  has  exactly  s  elements . 


(e)  If  k  =  H  then  X,,  Y   are  differentiable  manifolds  and 
the  differentials  dg  and  df1  have  full  rank  at  every  point. 

(f)  If  either  k  =  R  or  k  is  algebraically  closed, 
dim  f'^f-^x))  =  n  -  m, 

for  each  x  in  X.. . 

PROOF.  Let  A  :=  A(Y)  be  identified  through  A(f)  to  a 
subalgebra  of  B  :=  A(x).  By  (3.18),  there  exist  a  in  A  and  b  in 
B  such  that  a  A  and  b  "B  are  finitely  generated.  Then 
a  A  C  a  1  C  a" Vl.  By  BOURBAKI  [1972,  V.l.5],  there  is  some  s  in 
a"  A  such  that  s~  (a  A)  is  integrally  closed  in  its  quotient  field. 
Replacing  if  necessary  a  by  sa,  we  may  assume  that  a  A  is 
integrally  closed.  By  BOURBAKI  [1972,  V.3.1,  Corollary  1],  there  exist 
T,  ...,  T     algebraically  independent  elements  of  (ab)  B  such  that 


is 


(ab)  B  is  integral  over  A'  :=  a"  A[T,,  ...,  Tn_m] •  Since  a"  A 
integrally  closed,  A'   is  also  integrally  closed;  see  BOURBAKI  [1972, 
V.1.3,  Corollary  2]. 
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Let  s  be  the  separable  degree  of  the  algebraic  field  extension 
Q(B):Q(A').  By  BOUKBAKI  [1972,  V.2.3,  Remark  3],  the  fibers  of  the 
canonical  map  X((ab)_1B)  -> X(A)  have  at  most  s  elements.  Let 
X  :=  X((ab)_1B),  Y1  :=  X(a_1A)  and  let  fy   jy,  f^  g  be  respectively 
the  maps  dual  to  the  inclusions  B  C  (ab)""S,  A  C  a_1A,  a  A  C  (ab)  IB, 
A*  C  (ab)-:LB.  Then  (a),  (b),  and  (c)  hold  by  construction. 

If  k  is  algebraically  closed,  there  exists  an  open  set  V  in 
Y  X  kn~m  such  that  g_1(z)  has  precisely  s  elements  for  each  z  in 
V;  see  DIETJDONNE  [191k,   Chapter  5,  Proposition  6].   Let  UCg  (V)  be 
a  principal  open  set.  By  (5. 1*0,   ^(U)  contains  a  principal  open  set 
W.  Let  U  C  f_1(W)  Ou  be  a  principal  open  set.  Then  g(U)  C  V  implies 
that,  for  any  z  in  Y±  x  kn"m,   either  g_1(z)nu  is  empty  or  it  has 
exactly  s  elements.  Replacing  X]_  by  U  and  Tx  by  W  gives  (d); 
g  is  surjective  by  DIEUDONNE  [1972,  IV. k,   Corollary  1] . 

Let  k  =  R.  We  replace  X^,  Y   by  nonsingular  principal  open 
subsets;  see,  for  example,  BROCKER  [1975,  Theorem  12.12].  We  may  further 
replace  X   by  a  principal  open  set  in  which  dg  has  maximal  rank  p. 
By  (3.1U),   g(X  )  contains  a  strong  open  set  of  Y±  X  k   ,   so  by 
SARDS'  theorem  (see  BROCKER  [1975,  Theorem  2.11]),  p  =  n.  Similarly 
with  f  .  Thus  (e)  follows. 

(f)  For  k  algebraically  closed,  see  DIEUDONNE  [  197U,  Chapter 
k,   Theorem  2];  for  k  =  R  this  follows  from  (e)  and  BROCKER  [1975, 
Theorem  1.9l • 


CHAPTER  III.   REALIZATION  THEORY 

We  investigate  in  this  chapter  the  general  realization  theory  of  an 
m- input,  p-output  polynomial  response.  Before  doing  this,  we  develop 
the  formalism  of  Volterra  series  and  prove  some  simple  facts  to  be  used 
later  in  the  study  of  finiteness  conditions. 

Throughout  this  work,  k  denotes  an  infinite  field;  all  parameters 
belong  to  k.  The  assumption  that  k  is  infinite  is  merely  a  technical 
convenience,  permitting  the  identification  of  polynomials  and  polynomial 
functions . 

Both  m  and  p  (number  of  input  and  output  channels,  respectively) 
are  positive  integers,  arbitrary  but  fixed  throughout  the  discussion. 

5.  Volterra  Series. 

We  shall  use  the  following  notations: 

N  :=  set  of  nonnegative  integers. 

N311  :=  set  of  column  m- vectors  over  N. 

1[    :=  set  of  m  X  t  matrices  over  N_. 

(if1)   :=  set  of  all  finite  sequences  of  elements  of  g, 
including  the  empty  sequence  denoted  by  A. 

If  a    is  in  (jt)* ,     a  =  a...  .0^  4  A    "then  |a|  :=   length  of 
a  :=  t;  ||a||  :=  weight  of  ar  :«  0^  +  ...  +  Q^;   |a|  =  ||a||  :=  0; 
a.  .  :  =  element  in  i-th  row  of  column  vector  a.,   j  =  1,  ...,  t, 
i  =  i,  ...,  m. 

A  :=  set  of  proper  sequences:  cc     in  (Nj   belongs  to  A  iff 
a  =  A  or  a  =  a. . .  .a   with  a  4-  °- 

If  a   =  a.... a,   and  p  =  P-...P   are  in  (N™)  ,  ap  :=   concate- 

X     "t  X     s 

nation  of  a    and  p  :=  CL. .  •&^>1-  •  «Pg. 

If,  say,  t  <  s,  then  a  +   p  :=  sum  of  a    and  p  :=  ?'1--«7g, 
where  7.  :=  a.  +  p.   for  i  =  1,  ...,  t  (addition  of  a ,  p.   is  row- 
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wise  in  N^)  and  7.  :=  P.   if  i  =  t  +  1,  ...,  s.  Similarly  if  t  >  s. 

k2 
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Since  an  a  =  CL...O.   in  (if1)   is  a  sequence  of  columns,  we  may- 
regard  a,     as  an  m  X  t  matrix.  Thus  we  may,  and  shall,  make  the 
following  identification: 


iff  -  tWo  f*- 


t>0 

Under  this  identification,  concatenation  of  a  and  p  is  the  same 
thing  as  formation  of  the  block  matrix  [cdp];  addition  is  simply 
addition  of  matrices  (augmented  by  zeroes  to  the  right  if  necessary). 
Observe  that  the  notation  a.  .  is  consistent  with  the  matrix 
interpretation. 

Each  of  the  operations,  concatenation  and  addition,  make  (g  ) 
into  a  monoid;  in  both  cases  A  is  the  identity.  In  both  cases  A  is 
a  submonoid,  i.e.  if  a     and  p  are  both  in  A,   then  both  ap  and 
a  +  p  are  in  A.  We  shall  denote  by  (A,  • )  and  (A,  +)  the  two 
monoids  thus  obtained.  Both  monoids  will  play  a  central  role  in  our 
theory.  The  monoid  (A,  +)  is  used  in  defining  "polynomial"  and  (A,  •) 
is  used  in  obtaining  finiteness  conditions. 

Let  |  .,   i  =  1,  ...,  m,   j  =  1,  2,  3,  ...  denote  denumerably 
many  (distinct)  indeterminates,  and  for  each  j  let  |.  denote  the 
subset  |  ,  ...,  lmJ-  Let  a  -  0^.  •  .at  be  in  (ff     and  define 

a   ._  <*l   at 
i  .-  ^  ...it  , 

cc  • 
where  each  |  .<>     is  itself  a  monomial 

with  I?.  =  1  for  any  i,  j .  We  interpret  |   as  1.  A  formal  power 

ij  OL 

series  t  is  an  infinite  formal  combination  of  the  monomials  |   with 

coefficients  in  k.  Since  f   =  l"0  =  1°°°  =  . . .  ,   care  must  be  taken 

not  to  count  each  |°  more  than  once.  This  is  the  reason  for 

introducing  A.  Thus,  a  formal  power  series  in  the  £_  with 

coefficients  in  k  is  a  map 


kk 


t:  A  -»  k:   anil/  , 


denoted  also  as 


(5.1)      *(!,,   lo,    ...   )  =      2  A^|a,     * 


ainA  a  a 


in     k. 


The  set  of  formal  power  series  can  be  made  into  a  k-vector  space 
by  defining  term-wise  the  addition  of  two  power  series  ty,  $  and  the 
multiplication  by  scalars  r  in  k: 

(t  +  tflOa  :=  fa  +  r$a    for  all  a. 

Using  the  structure  of  the  monoid  (A,  +  ),  we  may  also  induce  a 
convolution  product  among  series,  which  extends  the  multiplication  of 
monomials  defined  as  £.£:=!   •  This  extension  is  well-defined 
because  (A,  +)  is  locally  finite,  i.e.  each  a  in  A  can  be  split 
in  only  finitely  many  ways  as  a  +  a,  a.      in  A.  The  convolution  of 
the  power  series  f  ,  ^p  is  then  defined  globally  by  the  formula: 

(5.2)   (W)  •=  L   *JL  for  all  a     in  A. 


The  set  of  all  formal  power  series  forms  a  k-algebra  when  endowed 
with  the  operations  of  scalar  product,  sum,  and  convolution  product.  In 
fact,  it  is  easy  to  prove  that  this  algebra  has  no  zero  divisors. 

We  intend  to  derive  response  maps  by  evaluating  power  series  for 
particular  values  of  the  i. ..  Thus  we  want  to  restrict  our  attention 
to  a  suitable  class  of  series  so  that  evaluation  at  arbitrary  input 
values  is  welldefined.  In  accordance  with  related  investigations  in  the 
literature,  we  shall  call  these  Volterra  series. 

Let  f  be  a  formal  power  series.  For  each  |.  .,  i|f  may  be 
rearranged  into  a  power  series  in  | .  .  whose  coefficients  are  power 
series  in  the  other  variables.  We  call  \|r  a  (formal)  Volterra  series 
(over  k)  iff,  after  each  such  rearrangement,  i|r  becomes  a  polynomial 
in  | .  . . 
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In  other  words,  t  is  a  Volterra  series  precisely  when  there  exist 

integers  d.   such  that  any  g.  .  appearing  in  \|f  has  exponent  <  d.  .. 

The  smallest  bound  d. .  for  the  exponents  of  g. .  is  the  degree 

id  ij       — B 

deg   ty  of  t  in  g.  .   (if  g.  .  does  not  appear  in  i,     deg.  .  f  :=  -«>) 

*«  ij       ij  ij 

Thus  i|c  is  a  Volterra  series  if  and  only  if  deg.  .  t  <  °°  for  all  i,  j. 

For  example, 

2     2  U     2  1;  8  2     pn 

*i :=  hi +  >n4 +  44si5  +  ••• +  >n-4 +  ••• 

is  a  Volterra  series,  and  deg  .  t,  =  2J  (if  m  >  1  then  also 

XJ   -*■ 


i|r  =  -  oo  for  i  =  2,  . ..,  m); 
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^o  :=  6n  +  8,«  +  6?o  +  •••  +  li.  +  •••  +  £  +  ••• 


is  also  a  Volterra  series,  with  deg..  .  i|r_  =  j;  but 

v3    11   11   511       511    ' 

is  not  a  Volterra  series. 

Since  finiteness  of  the  deg.  .  is  preserved  under  the  algebra 
operations,  the  set  of  all  Volterra  series  is  a  k-algebra.  This 
k-algebra  is  an  integral  domain,  since  the  algebra  of  all  power  series 
does  not  have  any  zero  divisors. 

(5-3)   NOTATION.  I  ,   or  just  t,      is  the  k-algebra  of  (formal) 
Volterra  series  over  k. 

(5.4)   DEFINITION.  The  degree  of  a  Volterra  series  \|r  is 

deg  t  :=  sup.  .  {deg.  .  \|r)  <  °°- 

Thus     deg  \|f-i    =  <leg  ^P  =  °°     for  the  above  examples  while,   on  the 
other  hand, 


deg  (a^      +  a^e       +  ...  +  afi       +  ...    )  =  1. 


1*6 


A  column  p-vector  of  Volterra  series  can  be  obviously  regarded  as 
a  power  series  in  the  g. .  with  coefficients  in  k ,  via  the 
identification 


*  = 


The  definition  of  degree  can  be  obviously  generalized  to  the  vector  case 

via  deg..  \|r  :=  max  {deg.  .  llr1',  ...,  deg.  .  i|AP'}.  We  let  ¥P  denote 
j-j  ij  ij 

the  set  of  all  vector  Volterra  series. 

Volterra  series  with  deg  $  <  °°     will  be  studied  in  detail  in 
Chapter  V.  An  important  tool  in  that  study  will  be  the  concept  of 
exponent  series,  which  we  now  introduce. 

The  concept  of  time-shift  is  incorporated  into  the  context  of 
Volterra  series  through  a  product  of  Volterra  series  which  is  based  upon 
the  monoid  (A,  •).  We  denote  by  \|r.\|r  the  Volterra  series  defined  by 

(5-5)   (*•+)„  :=  Jin  +p+r  for  all  a     in  A. 

Note  that  \|r.t  is  not  the  same  as  the  convolution  product  \|n|r  defined 
by  (5*2).  A  change  of  notation  is  useful  at  this  stage.  Instead  of 
writing  i|r  =  Z  fj  ,  we  shall  use  the  notation 

(5.6)  Z  1IM2, 

and  call  the  expression  (5.6)  the  exponent  series  cp  associated  to  \|r. 
Thus  cp  is  just  a  different  notation  for  the  same  mathematical  object  \|/. 
If  cp,  cp  are  associated  to  \|r,  i,  we  denote  cpcp  :=  \|r.\jr.  With  these 
notations,  the  product  (5.5)  can  now  be  expressed  simply  as  a  linear 
extension  of  the  multiplication  among  indeterminates : 

(5.7)  cpcp  =  (Z  cpca)(Z  app)  =  Z  Z  9a$pap. 


hi 


Exponent  series  provide  a  new  way  of  expressing  the  condition 
"deg  \|r  <«•".  For  this,  let 

(5.8)  supp  i|r  =  supp  cp  :=  {a     in     A  |  \|r     4  0), 

be  the  support  of  t  (or  of  its  associated  exponent  series). 

(5.9)  DEFINITION.  The  support  of  \|r  is  finitely  generated  iff 
there  exists  a  finite  subset  J  =  J ,   of  Nj   such  that 

(5.10)  supp  fcA  :=  J  Ha, 

i.e.,  each  column  a.     of  a  =  a   . .  .a      is  in  J,  for  each  a    jLn  supp  i|r. 
3         it 

In  terms  of  the  exponent  series  cp  associated  to  ty,  (5-9)  means  that 
cp  is  a  power  series  in  the  finitely  many  (noncommuting)  variables  in  J. 
Since  for  any  integer  d  there  are  only  finitely  many  vectors  in  W 
with  all  entries  <  d,  we  have  the  following  trivial 

(5.11)  LEMMA,  deg  t  <  «  if  and  only  if  supp  \|f  is  finitely  generated. 

The  fact  that  deg  \|r  <  °°  is  equivalent  to  cp  being  a  series  in 
finitely  many  variables  will  be  exploited  in  Chapter  V. 

We  now  return  to  our  study  of  arbitrary  Volterra  series.  Their 
introduction  was  motivated  by  the  need  of  evaluating  series  at  arbitrary 
input  values.  We  now  study  these  evaluations.  In  fact,  we  study  a  more 
general  type  of  operation  on  Volterra  series. 

Let  K  be  an  overring  of  k,  and  suppose  given  an  infinite  family 
r  =  {r. .,  i  =  1,  ...,  m,  j  =  1,  2,  . . .  )  of  elements  of  K.  As  before, 
we  introduce  the  shorthand  notation  r.  =  r  ,  ...,  r    for  each  j,  and, 
for  each  a  =  ex.... a.   in  A, 


ra   ._  ral   r°t  •=  raH  ,r°mt 
r  .-  r±   ...rt  .  r^   .r^  , 

(products  in  the  ring  K). 
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The  definition  of  a  Volterra  series  t  as  a  power  series  with  all 
deg.  .  t  <  °°  is  clearly  equivalent  to  the  requirement  that  f  be  a 
polynomial  when  expressed  as  a  series  in  each  finite  subset  of  variables 
|  ,  ...,  |  ,   for  any  fixed  t.  Thus  we  may  write 

(5.12)  1r(|   £   ...  )  =   Z   f  (L+1,  ...  )ia, 
l      d  ain  A  a  X+L 

|a|<t 

and  £   =0  for  all  but  finitely  many  a.  Let  us  make  the  substitutions 
|..Hr..  in  (5.12),  for  i  =  1,  . ..,  m  and  j  =1,  ...,  t,  performing 
the  resulting  products  between  the  r.  ..  We  obtain 

(5.1?)  Hrv    ...,  rt,  it+1,  ...  )  =   S  A  ^(lt+1,  ...  )ra. 

TaT^t 

Since,  as  we  already  remarked,  the  sum  in  (5-13)  is  finite,  a  further 
evaluation  |.  .  t->  0,   i  =  1,  ...,  m,   j  >  t,   results  in  a  finite  linear 
combination 

(5.14)  t(rx,  ...,  rt,  0,  ...  )  =Z^(0,  ...  )ra,  ^(0,  ...  )  in  k. 

When  K  =  k,   (5.IU)  is  then  in  k,  the  result  of  substituting  a  finite 
"input"  sequence  into  the  Volterra  series.  In  general  we  obtain  an 
element  of  K,  and  the  assignment 

(5.15)  *  -*K:  +  ^(r^  ...,  rt,  0,  ...  ), 

is  clearly  a  k-algebra  homomorphism. 

We  may  instead  apply  to  (5.I3)  the  further  substitutions 
L,  h»|     ,  i  =  1,  ...,  m,  j  >  t,  to  obtain 

(5.16)  tCr^  ...,  rt,  |x,  l2,  ...  )  =£^(1^  •••  )ra. 

Since  (5.16)  is  a  finite  K-combination  of  Volterra  series  over  kCK, 
we  may  regard  (5.16)  as  a  Volterra  series  with  coefficients  in  K;  this 
justifies  the  notation  t(r.,    ...,  OCS^  l2'    ' "    ^     °r  ^Ust 


h9 


(5.17)  t(rr    ...,   rt), 

instead  of     \|f(r.,    . ..,   r  ,    £   ,    ...    ).     An  alternative  is  to  view 
$(t.,    ...,    r  )     as  an  element  of     ¥ <&K.     The  assignment 

(5.18)  T ->Y®kK:  iff  H>t(rr    ...,  rt), 

is  clearly  a  k-algebra  homomorphism.  Moreover,  (5.18)  is  an 
isomorphism  when  the  r. .  are  algebraically  independent  over  1,      since 
it  just  amounts  to  a  relabeling  of  variables. 

In  order  to  state  a  technical  lemma  to  be  used  later,  we  need  the 
following 

(5.19)  NOTATION.  Let  t  >  0  be  an  integer.  Then- 

is  the  homomorphism  f  wt(L,  ...,  %,,    0,  ...  ). 

Since  k[  £..,  ...,  £.]  is  an  integral  domain,  ker  e   is  a  prime  ideal.  Also, 

(5.20)  H  ker  e.  =  {0}. 
t>0     t 

Let  s  <  t.  Then  there  exists  an  onto  homomorphism 

S,t:  k[*l'  ••"  *t]  -«*!'  — '  h]> 

obtained  by  setting  |.  .  =  0  for  i  =  1,  . ..,  m  and  j  =  s  +  1,  ...,  t. 

■*■  J 
By  definition  of  e  , 


(5-21)  es,t°et 


e  . 
s 


Let  B  be  a  k-subalgebra  of  ¥  and  write  B  :=  e.(B).  Then 
R.  :=  ker  €  |b  =  BOker  e   is  a  prime  ideal  of  B  and 
O  R.  cOker  e,  =  {0).  We  prove  a  technical 


(5 


,22)     LEMMA,     trdeg  B  =  sup   {trdeg  B, ). 
t>0  * 
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PROOF.  By  (k.2),  trdeg  B  >  sup  {trdeg  B  ).  Thus  it  will  be  enough 
to  prove  that  sup  {trdeg  B  }  =  n  <  °°  implies  B  -  B  for  all  large  t . 
Since  each  e    |b  :  B ,  -» B   is  onto,  the  integers  trdeg  B   form  an 

S  ■  "C  Xi         "C     S  ~t 

ascending  chain,  bounded  above  by  n.  So  trdeg  B  =  trdeg  B  .,  =  . . . 

r         r+1 

for  some  r.  By  {k.2),      e    |b   is  an  isomorphism  for  each 

t  >  r.  So  by  (5-21),   R ,  =  R  .  Therefore  R  =Hr  =  [0]  and 
—  '   t    r  r      t 

e  |B:  B  -  B   for  all  t  >  r. 

6.  Construction  of  0.     and  T. 

We  now  begin  to  define  response  maps. 

(6.1)  DEFINITION.  The  input  space  0,     is  the  k-space  X(Y). 

To  verify  that  0  is  well-defined  according  to  the  setup  developed 
in  Chapter  II,  we  remark  that  Y  is  k-reduced.  Indeed,  ker  e  is  a 
closed  ideal  for  each  t,  since  k[  £  ,  ...,  |  ]  is  k-reduced.  So  by 
(5.20)  the  ideal  {0}  is  closed,  i.e.  ¥  is  k-reduced. 

(6.2)  DEFINITION .  The  space  of  input  values  is 

m 
U  :=  k  . 

The  algebra  of  polynomial  functions  on  the  k-space  U  is 
k[T, ,  ...,  T  ].  Therefore  the  algebra  A(U  )  of  polynomial  functions 
on  the  t-fold  product  of  k- spaces  U  =  U  x  . . .  x  U  is  the  ring  of 
polynomials  in  mt  variables.  So  we  may  denote  A(U  )  by 

k[  !.. ,  ...,  |.  ]  .  We  adopt  the  notational  convention  of  writing  the 

t 
sequences  (u.. ,  u  ,  ...,  u  )  in  U   in  an  inverted  order  (u,,  ...,  u  ). 

Thus,  the  coordinate  function  I. .  acts  on  elements  of  U   by 

ij 

5,  .(u  ,  ...,  u  )  :=  u. .  =  i-th  entry  of  j-th  vector  counted  from  the 
id  s       1      ij 

right  (for  instance,  if  m  =  1,  then  £12(0,  1,  0,  0,  0)  =  0, 
I  , (0,  1,  0,  0,  0)  =l).  This  notation  will  be  consistent  with  the 
following  interpretation:   (u.,  ...,  u. )  represents  an  input  sequence 
such  that  u .  is  the  input  at  time  1  -  j . 
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Using  this  notation,   e.   gives  rise  to  a  closed  embedding 

t  s    t 

i.  :=  X(e, ):  U  -» fi.  Similarly,  each  i    :=  X(e   ):  U  -^  U   is  a 

t         t  S  ,  "t         S  ,  0 

closed  embedding,  mapping  sequences  (u  ,  . . . ,  vl  )  of  u   into 

t   s 
(0,  . . .,  0,  u  ,  . . .,  u2,  u  )  in  U  .  By  (5 .21)  the  following  diagram 

commutes : 


Therefore  the  sets  i,  (U  ),  t  >  0,   form  an  ascending  chain  in  0,    whose 
union  (limit)  may  be  identified  with  the  set  of  all  infinite  sequences 
(  ...,  u  ,  ...,  u  ,  u  )  with  finitely  many  nonzero  entries;  the  rule 

(6.k)        (    ...,    0,   ut,    ...,   ui)~utzt"1+    ...   +ur 

permits  identifying  this  union  with  the  set  of  polynomials  over  some 
symbol,  say  z,  with  coefficients  in  U  =  k  .  Thus  we  have  the 

(6.5)   NOTATION.  U[z]  :=  l^  ^(U*)- 

We  shall  denote  by  (0)  the  sequence  with  u =  0  for  all  t. 


(6.6)       LEMMA.  U[z]  =  n. 

PROOF.  Clear  by  (2.7-f)  and  (5. 20). 

We  may  regard  fi  as  a  "completion"  of  U[ z] .  This  will  allow  us  to 
endow  U[z]  with  the  geometric  structure  carried  by  a  subset  of  the 
k- space  n. 

There  will  be  no  danger  in  identifying  U   with  i  (U  ),   so  that  we 
may  think  of  U   as  the  closed  subset  of  U[z]  corresponding  to  the 
sequences  (  ...,  0,  u,,  ...,  u  ). 

Now  let  K  be  the  field  obtained  from  k  by  adjunction  of  dennumera- 
bly  many  new  indeterminates  S. .,   i  =  1,  ...,  m,  j   =   1,  2,  . . .  . 
Applying  (5-l8)  with  r   :=  S  ,  we  define 

1J         J- J 
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(6.7)  0t   :     *  -**®  ktS^    ...,   St]:  ilrH+MS^    ...,   St). 

By  duality,   there  is  a  polynomial  map 

(6.8)  5      :=  X(0t):    a  X  U     ->  fl. 

Thus  6^  is  the  map  whose  transpose  A(8.)  =  0.  .  We  claim  that  the 

t  +  Z  T, 

polynomial  action  of  U   upon  fi  given  by  B   is  the  natural  extension 
to  H  of  the  "concatenation"  maps 


(6.9)   Bs  t:  US  X  if  ->U   :  ((us,  ...,  ux),  (ut,  ...,  \))  m 

M<U,  ...,  U1?  Ut,  ...,  Ux). 


In  other  words,  we  have: 

(6.10)  LEMMA.  The  following  diagram  commutes  for  each  s,  t: 

S 


if  xtf 


s,t 


i  x  1 


♦  t 

a  x  u 


us 

i 

r 

a. 


+t 

s+t 


PROOF.  By  duality,  it  is  necessary  and  sufficient  to  verify  that 
the  following  diagram  commutes: 

A(6t) 


:s+t 


A(5s,t> 


e  [Bx,  ...,  St] 
T 
k[lr   •-,  6S][S1,  •-.,  St]. 


Note  that,  in  the  coordinates  displayed,  A(8g  t)  is  given  by  1^  i-»  S^ 
if  J  =  1,  ...,  t,   and  by  %±     w  |±  ,  t  if  5  >  t.  Thus  the  diagram 
commutes  by  definition  of  e  ,   es+t>   ef 

In  view  of  (6.10),  there  will  be  no  danger  in  denoting  the  operations 

5   ,  as  well  as  5. ,   for  all  s  and  t,   simply  by  concatenation 
s,t'  t 
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(6.11)  cw  :=  5  (cc,  v)  for  co  in  ft,  v  in  U  . 

Let  f  be  a  polynomial  function  ft  -»k.  Since  A(ft)  =  t(Y)  -  ¥, 
clearly  f  can  be  identified  with  a  Volterra  series  ty  =  ty„.  So,  by 
(3.8),  tlu*  =t°it, 

A(*«it)(T)  =  (A(it)oA(t))(T)  =  A(it)(t)  =  et(lr); 
thus  (with  the  notation  in  (5.IU)) 

(6.12)  tlu*:  (ut,  ...,  ux)  •^t(u1,  ...,  ut,  0,  ...  ). 

Since  U[z]  is  the  increasing  union  of  the  U  ,  a  map  U[z]  ->  X 
is  specified  by  its  restrictions  to  the  U  .  In  particular,  take  \|f  in 
¥.  Then  by  (6.12)  the  value  of  t|U[z]  at  (  ...,  ufi,  ...,  u^  is 
obtained  by  evaluating  the  power  series  i     at  |. .  :=  u   =  i-th  row  of 
u..  This  evaluation  is  well  defined  because  almost  all  u.  are  zero. 
A  continuous  function  is  already  determined  by  its  values  in  a  dense 
subset,  so  by  (6.6)   the  assignment  i|r  h*  f  |U[z]   is  one-to-one. 

Thus  the  following  mild  abuse  of  terminology  is  justified: 

(6.13)  DEFINITION.  A_polynomial  map  l\   U[z]  ->  X,  where  X  is  a 
k-space,  is  the  restriction  of  a  polynomial  map  l    :  ft  ->X. 

The  gist  of  the  introduction  of  ft  is  that  the  abstract  set  of 
input  sequences  U[z]   can  now  be  exhibited  as  a  dense  subset  of  the 
k-space  ft  and  thus  U[z]   is  itself  endowed  (by  restriction)  with 
coordinates  and  polynomial  functions. 

Thus  the  polynomial  functions  f:  U[z]  ->  k  are  in  a  bi«j§dti¥§ 
correspondence  with  Volterra  series  t,   via  evaluation  of  ty     at 
f   :=  u. ..  More  generally,  a  polynomial  map  f:  U[zl  -» lr  is  uriiduely 
determined  by  the  functions  7r.°f:  U[l]  -» k,  where  t ,,      jj  =  lj  ai)  3 
are  the  natural  projections  kP  -»  k;  So  polynomial  maps  f:  ti[2l  -4  k 
are  in  a  bijeetive  correspondence  with  ¥P,  the  ordered  p-tuples  of 
Volterra  series. 
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(6.1k)     DEFINITION.  The  space  of  output  values  is  Y  :=  )<? . 
Thus, 

(6.15)  ¥  ~  polynomial  maps  U[z]  -»Y. 
Finally,  we  define 

■^        -4- 
U  :=  Un  U    (disjoint  union). 

This  set  should  not  be  confused  with  U[z~],  the  set  of  finitely  nonzero 

■x- 
sequences,  which  was  obtained  as  a  quotient  set  of  U  ,   via  the 

identifications   (u,,  ...,  u  )  ~  (0,  ...,  0,  u  ,  ...,  u  ).  The  unique 

element  of  U   is  denoted   (0). 

(6.16)  DEFINITION.   The  output  space  T  is  the  set  Y    of  all  maps 
* 

U  ->Y. 

-* 
Thus  an  element  of  P  is  an  U  -indexed  sequence  of  elements  of  Y. 

By  (1.9)  and  (3.8),   r  is  a  k-space,  the  product  of  the  k-space  Y 

with  itself,   U   times. 

7.  Abstract  Response  Maps  and , Systems . 

For  any  vector  space  V  over  k  we  consider  the  set  of  sequences 
Z  -» V  with  support  bounded  on  the  left : 

V  :=  {u:  Z  -»V  I  (3  t  )(u(t)  =0  if  t  <  t  )}. 

The  shift  operator  cr   =  cr  :  Y  -*  Y  is  defined  by 

(cru)(t)  :=  u(t  +  l)   for  all  t  in  |. 
In  particular  we  shall  call 

U  :-   set  of  input  sequences; 

Y  :=   set  of  output  sequences. 
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(7-1)   DEFINITION.  An  input /output  map  £  is  a  map  f:  U  ->  Yj  f  is 
called 

(a)  (strictly)  causal  iff,  for  all  u,  u  in  U,   and  for  all 
x  in  Z,  u(t)  =  u(t)  for  t  <  t  implies  f(u)(t)  =  f(u)(t)  for 

t  <  t; 

(b)  constant  structure  (or  shift-invariant)  iff  f(cru)  =  crf(u) 
for  all  u  in  U. 

An  input /output  pair  of  f  is  a  pair  (f(u),  u),  u  in  U. 

An  input /output  map  can  be  regarded  as  an  abstract  description  of 
the  external  behavior  of  a  "black  box"  (physical  device,  computer,  etc.) 
operating  at  discrete  instants   . ..,  -2,  -  1,  0,  1,  2,    ...      of  time. 
Constant  structure  means  that  this  behavior  is  invariant  under  time 
shifts . 

Rather  than  working  with  the  input /output  map,  it  is  technically 
more  convenient  to  work  with  the  response  map  of  the  same  "black  box", 
i.e.,  with  the  description  of  outputs  resulting  immediately  after  the 
application  of  finite  sequences  of  inputs. 

(7.2)   DEFINITION.  A  response  map  f  is  a  map 

f:  U[z]  ->Y. 

Let     r\:  Y  ->Y:   y  ny(l);     then  the  assignments 

f  »-»f  :=  Ti°fju[z], 
and 

f  ^f(u)(t)    :=  f(a*u), 

establish  a  bisection  between  causal  constant- structure  input /output  maps 
and  response  maps. 

In  the  above  formulas  we  are  implicitely  identifying  U[z]  with  a 
subset  of  U,  via  the  rule 
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T-l 

u  z    + 

T 


where 


u(-  t)  :=  u     if  0  <  t  <  t  and  u(t)  =  0  otherwise. 


In  view  of  the  above  bisection,  we  shall  state  most  of  our 
definitions  and  results  in  terms  of  response  maps.  Only  when  dealing 
with  input /output  equations  shall  we  refer  again  directly  to  input/ 
output  maps. 

The  following  abstract  definitions  and  notations  are  mostly  well- 
known.  They  belong  to  "general  system  theory";  no  structure  is  imposed 
on  systems  or  preserved  by  response  maps.  Later  we  shall  refine  these 
definitions  by  adding  suitable  algebraic  structure. 

(7.5)   DEFINITION.  An  abstract  (constant  structure)  system  Z  is  an 
object  (X,  P,  h,  x  ),  where 

(a)  X  is  the  state  set; 

(b)  P:  XXU-»X  is  the  transition  map; 

(c)  h:  X  -»Y  is  the  output  map;  and 

(d)  x   in  X  is  the  initial  state,  and  satisfies  P(x  ,  0)  =  0. 

The  t-th  iterate  of  P  is  the  map  P^  ':  X  X  U  -»X  defined 
recursively  by 

P<°>  :-  h,     pW  :.  P, 

P(t+1,U,  (ut+1,  ...,  Uj»  :=P(P(t)(x,  (ut+1,  ...,  u2)),  ux). 

The  reachability  map  of  £  is 

g:  U[z]  -X:  u^*1  +  ...  +u1^P(t)(xtt,  (ut,  ...,  U]_)); 

the  t-step  reachability  map  is  g.  :=  g|u  ;  the  t-step  reachable  set 
is  Xt  :=gt(^). 

For  each  w  in  U  ,  t  >  0,  the  observable  map  induced  by  w  ijs 
hW:  X  -»Y:  x  wh'^'d,  w). 
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w       w 
The  basic  observables  of  Z  are  the  functions  h.  :=  7T  °h  , 

j  =  1,  ...,  P,  w  in  U  .  The  observability  map  of  Z  jis 
hr:X-*r:iH  (hW(x),  w  in  U  }. 

Z  is  called 

reachable  iff  g  is  onto; 

r 
observable  iff  h   is  one-to-one; 

abstractly  canonical  iff  both  reachable  and  observable. 

The  response  of  2  is  fz  :=  hog.  Given  a  response  f,  Z  realizes 
f  iff  f  =  fs. 

(7.1+)   DEFINITION.  Consider  abstract  systems  Z,  Z.  An  abstract 
system  morohism  T:  Z  ->  Z  is  given  by  a  map  T:  X  ->X  such  that: 

(i)  T(x*)  m  x9,     and 


commutes . 

Note  that  the  existence  of  a  system  morphism  T:  Z  -> Z  implies 
that  f„  =  fs;  this  is  proved  by  a  trivial  induction. 

We  shall  call  Syst  ,   the  category  consisting  of  abstract  systems 
=abs 
as  objects  and  morphisms  defined  as  in  (7-^)- 

Analogously  with  concepts  related  to  Z,  the  same  concepts  can  be 
defined  directly  in  terms  of  the  response  map: 

(7.S)   DEFINITION.  Let  f  be  a  response  map.  The  observable  map  of 
f  induced  by  w,  where  w  is  in  U  ,  is 
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fW:  U[z]  ^Y:  v  ->f(vw). 

The  basic  observables  of  f  are  the  functions  f .  :=  tt  °f, 

— ===== == ===== #        j     -, 

j  =  1,  . . . ,  p,  w  in  U  .  The  observability  map  of  f  is 

fr:  U[z]  -»r:  v  h»  {fW(v),  v  in  U*). 

w 
Thus  the  value  of  f   is  the  result  of  an  input /output  experiment 

in  which  the  output  is  observed  at  the  end  of  the  application  of  the 

concatenated  input  vw,  v  =  given,  w  =  chosen.  The  collection  of 

results  of  all  such  experiments  is  r  . 

(0) 
Since  fKYJ  -   f,  clearly 

f  =  f  iff  f r  =  fr. 

In  terms  of  the  new  notations,  the  definition  of  realization  may  be 
restated  as  follows.  For  any  system  S  and  any  w  in  U  ,  v  in  U  , 

fj(w)  =  fz(wv)  =  hog(wv)  =  hop(s+t)(xtf,  wv)  = 


=  h°P^(g(W),  V)  =hVog(W), 

r   r 

so  f „  =  h  °g.  Thus, 

(7.6)  Z     realizes  f  iff  fr  =  hr°g. 

We  now  restate  in  our  formalism  largely  well  known  facts  (see,  for 
instance,  EILENBERG  [191k,   Chapter  XII]). 

(7.7)  LEMMA.  Let  Z  =  (X,  P,  h,  xft)  and  £  =  (X,  P,  h,  x8)  have  the 
same  response  map.  Assume  that  2  is  reachable  and  that  Z  is 
observable.  Then  there  exists  a  unique  map  T:  X  -» X  such  that  the 
following  diagram  commutes: 


U[z] 
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This  unique  T  induces  an  abstract  system  morphism  Z  ->  2. 

PROOF.  We  first  prove  uniqueness.  If  such  a  T  exists,  then 
hr(T(g(w)))  =  hr(g(w))  for  all  w  in  U[z],   so  T(g(w))=g(w)  by 
observability  of  Z.  Since  Z  is  reachable,  every  state  in  X  is  of 
the  form  g(w).  Thus  there  is  a  unique  choice  of  T, 

(*)     T:  g(w)  »->g(w). 

To  prove  existence,  we  need  to  see  that  T  as  in  (*)  is  a  well- 
defined  map.  In  other  words,  if  g(w)  =  g(v),  it  should  follow  that 
g(w)  =  g(v).  But  hr(|(w))  =  hr(g(w))  =  hr(g(v))  .  hr(g(v))  implies 
g(w)  =  g(v)  by  observability  of  Z. 

We  now  prove  that  T  is  an  abstract  system  morphism.  Since 
T(x8)  =  T(g(0))  =  g(0)  =  x*,  we  are  only  left  to  prove  that 
T(P(g(w),  u))  =  P(T(g(w)),  u)  for  every  w  in  U[z]  and  every  u  in 
U.  But  T(P(g(w),  u))  =  T(g(wu))  =  g(wu)  =  P(g(w),  u),  as  required.         C 

(7.8)   THEOREM.  I«t  f  be  a  response  map.  Then  f  has  an  abstractly 
canonical  realization  E  ,  If  E  is  any  other  abstractly  canonical 
realization  of  f  then  there  exists  a  unique  isomorphism  T:  Z   ->  Z. 

PROOF.  Abstractly  canonical  realizations  Z  correspond  to 
factorizations  r  =  g„oh  ,  where  g„  is  onto  and  hi  is  one-to-one. 
We  first  prove  the  uniqueness  part.  Given  any  two  abstractly  canonical 
systems  Z  and  Z,  there  exist  by  (7.7)  T  :  Z  -» Z,  T  :  Z  -» Z.  Since  T  °T 
is  an  abstract  system  morphism,  it  follows  by  the  uniqueness  part  of  (7-7) 
that  T°T  =  identity  abstract  system  morphism  Z  ->  Z.  Similarly, 
T  °T   is  the  identity  Z  -» Z.  Thus  T-,  Tp  are  inverse  abstract  system 
isomorphisms . 

For  the  existence  part,  simply  take  X   :=  f  (U[z])  C  r, 


x*  :=  f  (0),  h   :=  projection  onto  (0)-th  factor,  and 

c  p  acp 

P  (r  (w),  u)  :=  fMwu).  Note  that  P    is  well  defined,  since 

acv  v  "      '  K     '                               ac              ' 

fr(w)  =  r(w)  implies  that  for  any  u,  v,  fv(wu)  =  f(wuv)  =  fUV(w)  = 
=  f  (w)  =  f  (wu). 
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8.  Polynomial  Response  Maps  and  k-Systems . 

We  begin  now  to  study  response  maps  which  are  polynomial  (Definition 
(6.I3)).  The  input /output  map  associated  to  a  polynomial  response  map 
will  be  called  a  polynomial  input /output  map. 

The  notion  of  polynomial  response  maps  is  new,  as  is  the  following 

(8.1)  DEFINITION.  The  abstract  constant  structure  system  E  =  (X,  P, 

,- 

h,  x  )     is  a  k- system  iff  X  is  a  k-  space  and  both  P  and  h  are 
polynomial  maps. 

The  terminology  "k-system"  is  just  a  convenient  short  name  for  our 
systems.  In  no  way  is  it  intended  to  imply  that  (8.1)  constitutes  the 
most  general  class  of  systems  definable  in  terms  of  the  field  structure 
of  k. 

(8.2)  LEMMA.  The  reachability  map  g:  U[z]  ->  X  of  a  k-system  E  is 
a  polynomial  map. 

PROOF.  By  definition  (6.I3),  we  need  to  find  a  polynomial 
extension  g  :  0.  -» X  of  g.  Dually,  we  construct  a  homomorphism 
A(g^):  A(X)  -»¥  such  that  et°A(gfi)  =  A(gt)  for  each  t  >  0. 

Pick  any  a  in  A(x).  Then  A(g. )(a)  =  a°gt  belongs  to 
k[|,,  ...,  i.]  for  each  t  >  0.  We  define  a  power  series  t   in  the 

1         Tt  — 

variables     | .  .     by 

i|ra  :=  coefficient  of     |a    in    A(g.    .)(a), 
t*  \cc\ 

for  each    a    in    A. 

a  a 

Now  we  claim  that  deg.  .  ^  <  °°  for  each  i,  j,   i.e.,  t|t   is 

a  Volterra  series.  Take  any  i,  j.  Let  t  >  j.  From  the  definition  of 

p'*^  it  is  clear  that 

p(t)  m  p(j).(p(t-j)  x  1^)> 


As  gt  =P(t)(x»,  •), 
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aogt  =a.P(d).(P(t-j)(x*,-)x  1^). 


Thus  the  degree  to  which  |. .  appears  in  a°g. ,  for  any  t  >  j,  is  at 

1J        (i)  ~  a 

most  the  degree  of  the  polynomial  a°PVt"  in  the  variable  g.  ..  So  i|r 

is  a  Volterra  series. 

Let  A(g  )  :=  a  h*  \|r  .  By  construction,  e.i|r  =  A(g^)(a)  for  each 
a  in  A(x)  and  t  >  0.  Since  each  A(g,  )  is  a  homomorphism,  A(g  )  is 
also  a  homomorphism.  [ 

The  following  result,  the  main  of  this  section,  suggests  that 
k-systems  are  the  natural  realizations  of  polynomial  response  maps.  This 
will  be  confirmed  later  by  the  result  on  canonical  realizations.  We  shall 
reserve  the  name  "polynomial  systems"  for  a  special  type  of  k- system  in 
which  a  strong  finiteness  condition  holds,  which  will  allow  P  and  h  to 
be  represented  by  actual  polynomials. 

(8.3)   THEOREM.  The  response  map  f  is  polynomial  if  and  only  if  f  is 
realized  by  some  k- system. 

PROOF,  ["only  if"]  The  free  realization  Z    (f)  := 
(0,  5.,  f  ,  (0))  is  a  k-system  realizing  f. 

["if"].  Let  f  =  f „  for  some  k-system  2.  Define 


Z 


f     :=  h°g  . 


Then  f  |u[z]  =  hog  |u[z]  =  h°g  =  f .  So  f  is  polynomial. 

Properties  of  X  serve  to  classify  k-systems. 

We  shall  say  that  S  is  a  polynomial  system  [ respectively  almost 
polynomial]  iff  X_  is  a  variety  [respectively  an  almost 
variety] . 

An  irreducible  Z  is  one  for  which  X„  is  irreducible. 

Similarly,  we  define  (recall  Section  (U)) 

dim  Z  :=  dim  X„. 
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The  polynomial  functions  X„  -» k  are  the  costates  of  Z. 

The  notion  of  abstract  system  morphism  is  too  weak  to  serve  for 
comparing  k-systems.  A  suitable  category  Syst  of  k- systems  is 
obtained  with  morphisms  as  in  the  following 

(8.U)   DEFINITION.  An  abstract  system  morphism  T:  Z  ->  Z  between  two 
k-systems  is  a  k-system  morphism  iff  T:  X  ->X  is  a  polynomial  map. 
T:  Z  ->Z  is  dominating,  a  closed  embedding;,  etc.,  iff  T:  X  ->X  has 
the  corresponding  property;  Z  dominates  Z  iff  there  exists  a 
dominating  T:  Z  ->Z;  Z  is  a  closed  subsystem  of  £  iff  there  exists 
a  closed  embedding  T:  Z  ->Z. 

It  is  easy  to  see  that  k-systems  form  a  category  with  the  above 
notion  of  morphism.  Note  that  a  k-system  isomorphism  T:  Z  -  Z  is 
the  same  as  a  polynomial  change  of  coordinates  in  the  state  space. 


9.  ftuasi-Reachability. 

Erom  here  until  the  end  of  Chapter  IV,  f  is  an  arbitrary  response 
map  and  Z  =  (X,  P,  h,  x")  is  an  arbitrary  k-system. 


The  reachable  set  of  Z  is 

x^gcufz])^^)  =ty0xt. 

(9.1)  DEFINITION.  Z  is  quasi-reachable  iff  1CR  =  X. 
By  (6.6)   and  (3. 11)  we  have  the  following 

(9.2)  LEMMA.  The  following  statements  are  equivalent: 

(a)  Z  is  quasi-reachable. 

(b)  g   is  dominating. 

(c)  A(g  )  is  one-to-one. 

(9.3)  LEMMA  (SONTAG  and  ROUCHALEAU  [1975]).  Xt  =  Xt+1  for  some 
t  >  0  implies  X.  =  X_. 
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PROOF.  Since  P  is  polynomial,  it  is  continuous;  thus 

xt+2  =  p<xt+i  x  u)  £  p(xt+i  x  u)  =  p(xt  x  u)  £  p(xt  x  u)  =  xt+r 

Since  clearly  X^  C  X^,  it  follows  that  J^  C  Xt+2  C  ^  =  x^, 
so  X,  .  =  X .  p  and  the  result  follows  by  induction. 

(9-U)   COROLLARY.  If  dim  E  =   n  <  »     then  X  =  5L. 
—  n    R 

PROOF.  Since  g  is  continuous  and  U  is  irreducible  for 
each  t  >  0,  X,  is  irreducible.  By  (U.5),  the  chain  {X,  }  cannot 
have  length  greater  than  n.  So  (9-3)  gives  the  desired  result. 

Thus  in  the  finite-dimensional  case  a  quasi-reachable  system  is 
quasi  reachable  in  bounded  time.  The  analogous  statement  for 


reachability  is  false,  as  illustrated  by  the  following  example.  Take 

tt  p 

k  :=  R,  m  =  p  :=  1,  X  :=  R^  x  :=  0,  P(x,  u)  :=  x  +  u  -  2u,  and 
h  arbitrary.  Then  X  =  {x  in  E  |x>-t)  /L  =X  for  all  t  >  0. 

(9.5)   LEMMA.  E  has  a  quasi- reachable  closed  subsystem  EQ. 

PROOF.  Let  X-  :=  X_.  Since  P  is  continuous,  P(X_  X  U)  C  Xr 
fc£     R  u  H  ~     ^ 

We  may  therefore  define  E  :=  (X_,  P|XQ  X  U,  h|XQ,  xR).  The  inclusion 

of  L  CX  exhibits  EA  as  a  closed  subsystem  of  E. 

10.  Algebraic  Observability. 

As  discussed  in  intuitive  terms  in  KALMAN  [1968,  Chapter  10], 
observability  of  E  means  the  existence  of  a  procedure  for  determining 
the  state  x  of  E  from  data  obtained  by  experiments  of  the  type: 
"apply  an  input  sequence  to  E  beginning  in  state  x  and  observe  the 
corresponding  output  sequence".   In  terms  of  the  basic  observables   {h.) 
introduced  in  (7-3)>  this  informal  description  of  observability  can  be 
made  precise  by  requiring  the  existence  of  a  set  of  arbitrary  functions 
of  experiments 


«  (hWl>A      hWr(A),Ax 
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with  A  =  (A)  some  arbitrary  indexing  set,  such  that  each  state  x  is 
uniquely  determined  by  the  data  {t|»(x)],   . 

When  this  procedure  is  interpreted  in  the  weakest  possible, 
nonconstructive  sense,  the  functions  tw   are  completely  arbitrary  and 
"observability"  reduces  to  the  abstract  definition  (7-3) •  In  "the  case 
of  (finite-dimensional)  linear  systems  over  a  field  this  abstract 
definition  turns  out  to  be  equivalent  to  the  existence  of  linear 
combinations  tk  which  give  every  coordinate  of  the  state;  see 
KAIMAN  [ 1968,  Chapter  10] .  For  linear  systems  over  a  commutative  ring, 
however,  the  abstract  notion  of  observability  is  no  longer  equivalent 
to  the  existence  of  a  linear  procedure;  some  of  the  resulting  problems 
are  studied  in  SONTAG  [ 1976,  1977] .  In  general,  observability  should 
be  formalized  with  reference  to  the  particular  category  over  which  the 
system  in  question  is  defined.  Thus,  in  the  context  of  k-systems 
observability  is  defined  by  requiring  that  each  coordinate  of  the  state 
(i.e.,  every  costate  of  the  system)  be  a  polynomial  in  the  basic 
observables.  This  is  the  definition  given  below,  which  is  a  direct 
generalization  of  that  given  in  SONTAG  and  ROUCHALEAU  [1975]  for 
polynomial  systems.  A  direct  study  of  bilinear  response  maps,  KALMAN 
[1976]  suggests  the  same  conclusion. 

(10.1)  DEFINITION.  The  observation  space  L(Z)  [respectively  the 
observation  algebra  A(Z)]  is  the  linear  subspace  [respectively  the 
subalgebra]  of  A(x)  generated  by  the  basic  observables.  An  observable 
is  a  costate  in  A(Z).  Z  is  algebraically  observable  iff  A(Z)  =  A(z). 
When  X  is  irreducible,  the  observation  field  Q(Z)  is  the  quotient 
field  of  A(Z). 

Consider  the  observability  map  h  :  X  -» V   =  Y    introduced  in  (7-3). 

w  r 

Since  each  h   is  a  polynomial  map,  h   is  also  a  polynomial  map.  By 
(1.9),  A(r)  is  generated  by  the  algebras  A(Y)  appearing  in  the 
coproduct.  So  the  image  A(h  )(A(r))  coincides  with  the  algebra 
generated  by  the  A(hw)(A(Y)),  each  of  which  is  itself  generated  by 
h*  ...,  hW.  We  conclude  that  A(Z)  is  the  image  of  A(r).  The  dual 
of  this  fact  is: 
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(10.2)  LEMMA.  Z  is  algebraically  observable  iff  hT     is  a  closed 
embedding. 

(10.3)  COROLLARY.  If  Z  is  algebraically  observable  then  Z  is 
(abstractly)  observable. 

Algebraic  observability  is  a  stronger  requirement  than  abstract 
observability.  This  is  clear  from  the  counterexamples  given  in  (3. 12). 

We  remarked  in  (9-5)  that  every  k-system  has  a  closed  quasi- 
reachable  subsystem.   It  is  less  trivial  to  prove  the  corresponding 
statement  for  algebraic  observability: 

(10.4)  PROPOSITION.  2  dominates  an  algebraically  observable  system 

zobs. 

PROOF.     Let     i:  A(Z)  ->A(x)     be  the  inclusion  map.     Let 
X°bS   :=X(A(Z)),      x*°bs   :=  A(i)(x#).     Since     A(h)(A(Y))  C  A(x),     we 
may  factor     h:  X  ->Y     as 

X b +j, 

X(i)\      /hobs> 

\vots  obs 

X   ,        for  some  h 

Thus  the  proof  will  be  complete  if  we  can  prove  that  P  induces 
through  A(i)  a  k-system  morphism  Pobs :  X°bs  x  U  -» Xobs ;  then  X(i) 
is  the  required  dominating  k-system  morphism  Z  -» Z°  s.  Therefore  we 
must  show  that 

(10.5)  A(P)(A(Z))  CA(£)[T  ,  ...,  TJ. 

i.e.,  we  must  prove  that  when  an  element  q  of  A(P)(A(Z))  is 

expressed  as  a  polynomial  in  the  variables  T, ,  ....  T  ,  the 

1       m 

coefficients  of  such  a  polynomial  q  are  again  in  A(Z).  Since  the 
algebra  A(Z)  is  generated  by  the  space  L(Z),   statement  (IO.5) 
follows  from  the  following 
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(10.6)  LEMMA.  Let  £.  be  the  subspace  of  A(x)  generated  by  all 
those  hf  with  w  in  U  .  Then 

PROOF.  We  first  observe  that  when  T±,      ..,   ^m     are  specialized 

at  an  u  in  U  =  km,  a  polynomial  in  AfP)!^.  becomes  an  element  of 

f  ,.  Indeed,  denote  by  e(u):  A(X)[T  ,  ...,  T  ]  -»k  the  corresponding 
~t+l  x        m 

specialization.  Then 

e(u)°A(P)°A(hW)  =  A(hW). 

Therefore  e(u)(A(P)L)  C  L,+1,  as  wanted.  Our  claim  follows  from  the 
following  more  general  result  (with  A  =  A(X),  F  =  k[Tr  ...,  Tj  and 
the  {c, )  a  finite  set  of  monomials  in  T^,  . . . ,  Tm) : 

(10.7)  MAIN  LEMMA,  PART  1.  Let  A,  F  be  vector  spaces  over  k,  with 

F  a  space  of  functions  Z  -*  k  for  some  set  Z.  Assume  that  c^  ...,  c 
are  linearly  independent  elements  of  F,  and  let  a^  ...,  an  be  in  A. 
Then  the  linear  subspace  of  A  generated  by 

n 

{ili  ci.(z)ai>  z  in  Z-1, 

coincides  with  the  subspace  generated  by  a^,  ...,  an- 

PROOF.  Clearly  I  cAz)&±     is  in  the  space  generated  by 

a  a  .  It  is  then  enough  to  prove  that  each  a  ,  say  a  ,  can 

1>    '  n  x        x 

be  written  as 

for  some  z, ,  ...,  z  €  X.  Rewriting  this  expression  as 
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we  see  that  it  is  enough  to  prove  the  existence  of  a  A  in  k*1  such 
that  TA  =  (1,  0,  0,  . ..,  0)*,  where 

f  VV     Cl(Z2}   ••'   Cl(zn^ 
T  = 

c  (z..)     c  (z0)   ...   c  (z  ) 
n  1      n  2  n  n 

It  is  therefore  enough  to  find  z.,  . . . ,  z   such  that  T  is 
nonsingular. 

Form  the  n  x  Z  matrix  T  whose  i-th  row  is  c.  seen  as  an 
element  of  k2.  Then  existence  of  T  (as  a  submatrix  of  f)  follows 
from  the  fact  that  rank  T  =  n.  [ 

11.  Existence  and  Uniqueness  of  Canonical  Realizations. 

(11.1)  DEFINITION.  Z  is  canonical  iff  Z  is  quasi-reachable  and 
algebraically  observable. 

We  associate  yet  another  map  to  f .  The  extended  observability  map 
fir 
f  :  fi  ->r  of  f  is  the  observability  map  of  the  system  Z^   (f) 

P  free 

introduced  in  (8.3).  The  observability  map  f  :  U[z]  -» r  introduced 

OP 

in  (7.5)  is  clearly  the  restriction  of  f    to  U[z].  Since  U[z]  is 
dense  in  fi,  we  may  immediately  generalize  (7-6): 

(11.2)  Z  realizes  f  iff  f QT  =  hr°gn. 

(11.3)  LEMMA.  Let  Z  =  (X,  P,  h,  x*)  be  a  quasi- reachable  and 

Z  =  (X,  P,  h,  x  )  an  algebraically  observable  k- system  which  realize 
f.  Then  there  exists  a  unique  k- system  morphism  T:  Z  -> Z. 

PROOF.  Consider  the  diagram 
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o  ^ 

By  hypothesis,  g   is  dominating,  h   is  a  closed  embedding,  and 

h  og  =  f   =  fi  og  .  Thus  by  the  dual  of  (1.5)  there  exists  a 

polynomial  map  T:  X  -»X  making  (11. k)   commutative.  Restricting  to 

U[z]  C  ft,  the  diagram 


U[z] 


commutes.  Thus  we  may  apply  (7-7)  to  the  abstract  systems  (XR,  P|Xr  X  U, 

h|X^,  x*)  and  Z.     We  conclude  that  the  continuous  maps  T°P:  X  x  U  ->X 

and  P°(T  X  1  )  coincide  in  the  dense  subset  X_,  x  U,  so  they  are  equal.  D 
v     u  R 

The  main  result  of  this  chapter  is: 

(11.5)  THEOREM.  Let  f  be  a  polynomial  response  map.  Then  there  is  a 

canonical  k- system  Zf    realizing  f .  If  Z     is  any  other  canonical 

k- system  realizing  f,  there  is  a  unique  k-system  isomorphism  T:  Zf  -> S. 

PROOF.  Uniqueness  is  clear  by  (11. k) .     To  prove  existence,  take 

the  system  Z    (f);  this  is  quasi -reachable  because  g   is  the 

identity.  Applying  (10.4)  we  obtain  the  observable  system 

£.  :=  (Zj,       (f))°bs.  Since  Z.   (f)  dominates  Z„,     the  latter  is  also 
f    v  free  freev  1  a 

quas  i- reachable . 


CHAPTER  IV.   FINITENESS  CONDITIONS 

We  have  shown  in  the  previous  chapter  that  any  polynomial  response 
map  f  is  realizable  by  a  canonical  k-system.  We  now  turn  to  studying 
what  conditions  must  f  satisfy  in  order  that  the  canonical  system  Z 
has  various  finiteness  properties. 

The  main  tool  in  this  study  will  be  three  structures  obtained  from 
the  basic  observables  (7-5)  of  f  by  different  algebraic  operations: 
the  observation  space  L_,  algebra  A^  and  field  Q  .  We  show  that 
the  conditions 

Qu  =   finitely  generated  field  over  k, 

A-,   =  finitely  generated  algebra  over  k, 

L_  =  finitely  generated  vector  space  over  k, 

each  corresponds  to  an  important  characterization  of  Z  . 

We  then  relate  each  of  the  above  conditions  to  the  existence  of 
certain  input /output  equations  for  f . 

We  also  show  how  to  check  the  finiteness  condition  on  CL,  via  a 
Jacobian  criterion.  As  an  application  we  show  that 

y(t)  =  u(t  -  1)  +  u(t  -  2)2  +  ...  +  u(t  -  Z)&  +   ...  , 

has  no  possible  finite-dimensional  realization. 

In  the  final  section  we  discuss  examples  and  counterexamples 
associated  to  the  results  and  constructions  of  the  last  two  chapters. 

We  continue  to  denote  by  f  an  arbitrary  polynomial  response  map. 

12.  The  Observables  of  f. 

Since  Z   is  quasi -reachable  and  ft  is  irreducible,  it  follows  that 
£_  is  irreducible,  so  Q(A(X_))  is  well-defined: 

(12.1)  DEFINITION.  The  observation  space  L   [respectively  observation 
algebra  A„,  respectively  observation  field  CL,]  of  f  is 
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L(Ef)  [  respectively  A(Sf),  respectively  &(£f)] 


Thus 


h*>     Lr>     &f  are  the  sPace>  algebra,  and  field  generated  by 


the  basic  observables  {(ffi)*,  w  in  if,  J  =  1,  ...,  p)  of  sfree(f) 
Each  function  (f  )v.:   fi  ->k:  oj  *->7r.(f(w))  is  already  determined  by  its 
restriction  f.  to  U[z],  which  is  dense  in  Q.  The  restriction 
(f  )*  *-♦  f*  serves  to  establish  the  following  identifications  in  terms 
of  the  basic  observables  of  f  introduced  in  (7.5): 

(12.2)  L-  is  the  subspace  of  k     generated  by  the  basic  observables 

tV,     w  in  U*,  i  =   1,  ...,  p. 

J 

(12.3)  A^  is  the  subalgebra  of  k  '•Zj  generated  by  L^. 

The  f*  can  be  also  viewed  as  maps  U  -» k,  so  one  can  also 
J  U* 

identify  L_  and  A_  with  the  subspace  and  subalgebra  of  k 

w 
generated  by  the  f .. 

J 

Yet  another  representation  of  the  observables  is  obtained  via  the 
(vector)  Volterra  series  ♦  _  =  (ti  ,  ...,  tiP0f  of  f   (see  (6.15)). 

U+.  . ,Ut 

By  the  discussion  in  (5.l6)-(5-17),  the  Volterra  series  of  f^    x  is 
precisely  tf(u.,  •••>  \)'>     coordinatewise: 

(12. k)     The  Volterra  series  of  fut---ul  is  tj.  (\>    ■••»  ut)- 

Thus  L„,  A.,  and  Q.  can  be  interpreted  as  the  space,  algebra,  and 
field  generated  by  the  series  (t^  (u^  ...,  ut),  $   =  1,  ...,  P, 
u  .  .  .u   in  U  }.  Using  this  interpretation,  we  may  define 

(12.5)  deg   f  :=  deg^  *f,  deg  f  :=  deg  tf- 

By  (12.4)  it  follows  that 

(12.6)  deg,,  fW  <  deg,     f,  deg  fV  <  deg  f, 

t 
for  w  in  U  . 

The  observables  are  the  main  system  invariants  in  our  approach.  The 
study  of  Lp,  A»,  &-  will  be  simplified  by  the  consideration  of  various 
chains  which  approximate  them: 


71 


(12.7)  DEFINITION.  The  observability  chains   {L  '  (Z),   t  >  0}  and 
{A0,t(Z),  t  >  0}  of  Z  are,  respectively,  the  subspaces  and  subalgebras 
of  A(Z)  generated  for  each  t  by  the  elementary  observables  h  , 

j  =  1,  ...,  p,  w  in  \f ,     r  <  t.  The  reachability  chains 

{LR,t(E),  t  >  0}  and  {AR,t(Z),  t  >  0}  are,  respectively,  the 

subspaces  and  subalgebras  generated  for  each  t  by  the  restrictions 

h*|x,,  j  =  1,  . ..,  P,  w  in  U  .  The  diagonal  chains   {L  (Z),   t  >  0) 

j  t  . 
and  {A  (Z),  t  >  0}  are,  respectively,  the  subspaces  and  subalgebras 

generated  for  each  t  by  the  restrictions  h . |Xt>  j  =  1,  ...,  p,  w 

in  if,     r  <  t.  When  Z  is  irreducible  there  are  corresponding  chains 

of  fields  Q0,t(Z),  etc.  The  observability.  reachabilityT  and  diagonal 

chains  of  f  are  L0,t  :=  L0,t(Zf),  etc. 

We  write  L0,t,  etc.,  instead  of  L0,t(Z),  etc.,  when  there  is  no 
danger  of  confusion.  In  terms  of  Volterra  series,  we  have: 

(12.8)  L?'*,  ^^C?  are  generated  by  all  t{(\,    ...,  ur),  r  <  t; 

(12.9)  lJ'*,  ^'tCk[|1,  ...,  £t]  are  generated  by  all  €t+f; 

(12.10)  L*,  A^Cktl^  ...,  £t]  are  generated  by  all  et(+f(«1»  •••,  ur^' 
r  <  t. 

Before  proving  some  properties  of  the  chains  just  introduced,  we  need 


the 


(12.11)  MAIN  LEMMA,  PART  2.  Let  A  be  a  k-algebra,  and  take  a  polynomial 
Q  in  A[T  ,  . . .,  T  ],  for  some  r  >  0.  Let  D  be  a  dense  subset  of  k  . 
Then  the  linear  subspace  of  A  spanned  by  {Q(u),  u  in  D}  is  equal  to 
the  linear  span  of  (Q(u),  u  in  k  ). 

PROOF.  We  may  assume  that  Q  4  °-  Write 

Q(TX,  ..-,  Tr)  =  j^c.OV  ...,  Tr)a., 
with  all  c.  in  k[T.,  ...,  Tr]  and  all  a±     in  A,  and  with  s  smallest 
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possible.  Minimality  of  s  implies  that  both  {a  ,  ...,  a  }  and 
{c.,  . ..,  c  )  are  linearly  independent  over  k. 

We  claim  that  the  restrictions  c.|D  are  linearly  independent  as 
functions  from  D  into  k.  Otherwise  there  would  exist  b_,  ...,  b 
in  k  such  that  Z  b.c.(d)  =  0  for  all  d  in  D.  By  continuity  of 

the  polynomial  function  Z  b.c:  k  -»k,  it  follows  that  Z  b. c.  =  0 

r 
on  all  of  k  ,  contradicting  linear  independence  of  the  c. . 

Main  Lemma  (10. 7)  can  then  be  applied  twice,  first  with  Z  :=  D  and 
then  with  Z  :=  kr.  Thus  {Q(u),  u  in  D)  and  (Q(u),  u  in  kr] 
both  span  the  same  space  as  {a.. ,  . . . ,  a  ) .  C 

We  collect  below  some  rather  technical  facts  which  will  be  needed  in 
deriving  the  main  results  of  this  chapter. 

(12.12)  PROPOSITION.  For  any  2  and  f, 

(a)  If  L0,t(Z)  =  L°'t+1(2)  for  some  t  >  0,  then 
L0,t(E)  =  L(S). 

00  II   A0,t(^)  ■  A0,t+1(z)  for  some  *  >  °>    then 

A0,t(E)  =  A(Z). 

(c)  If  A0,t+1(Z)  is  algebraic  over  A0,t(Z)  for  some  t  >  0 
and  if  {P(x,  u),  x  in  X,  u  in  U]  is  dense  in  X  (e.g.,  if  Z 
is  quasi -reachable),  then  A(Z)  is  algebraic  over  A  '  (Z). 

(d)  If  A°,t+1(Z)  is  integral  over  A0,t(Z)  for  some  t  >  0, 
then  A(Z)  is  integral  over  A  '  (Z). 

(e)  L,/   is  finite  dimensional  and  A_'   is  a  finitely 
generated  algebra  for  all  t  >  0. 

(f)  If  dim  Z  =  n  then  dim  Zf  <  n. 


(g)  Let  dim  Zf  =  n.  Then  en|Lf:Lf  -  L^'n,   and  €n|Af:Af  -  A^'n. 
(h)  dim  Z  =  sup  {trdeg  Af). 

PROOF,   (a)  By  (10.6), 
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(12.13)     A(P)(L°'r)   =  A(P)(0iJ£r  4)   -  ^  A(P)(^)  C 

CL°'r+1[Tl,    ...,    TJ, 

for  all     r  >  0.     By  the  argument  in  (10.6),     L^     is  spanned  "by  the 
coefficients  of  the  polynomials  in 

A(P)(Lt+1)CA(P)(L°'t+1)   =A(P)(L°>t)CL°>t+1[T1,    ...,    Tj , 

0,t+l 
(using  (10.3)).  Thus  L.  „  is  spanned  by  elements  of  L     ,  i.e 

L0C  L°'t+1.  Then 

T0,t+2   T0,t+1 
L       =  Jj      , 

and  (a)  follows  by  induction. 

(b)  Analogous  to  (a). 

w  t+1 

(c)  Take  any  elementary  observable  h-j  with  w  in  U 

Since,  by  hypothesis,  Ik  is  algebraic  over  A0'*,  there  exists  an 
s>0  and  a  polynomial  L(Tq,  ...,  Tg)  in  k[TQ,  ...,  Tg]  with  the 
properties : 

(D   Kh*  h^i,  ...,  h!»)  ,0 

for  some  elementary  observables  h^1,  . ..,  tus,  v   in  IT1,  r  <  t,  and 

Jl        s 
(ii)  if  L(T1}    ...,  Ts)  is  the  leading  coefficient  of  L  expressed  as  a 

polynomial  in  T  ,   then 

(12.11+)  L(h^l,  ...,  h^)  ^0. 
Jl       Js 

If 

L(hUV1(x),  ...,  hfs(x))  =  £(h^(P(x,  u),  ...,  hI«(P(x,  u))  =  0 


7U 


for  all  u  in  U  and  x  in  X,  the  density  hypothesis  leads  to  a 
contradiction  of  (12. Ik).     So 

D  :=  (u  in  U  |  £(h^,  ...,  hUVs)  +   0}  +   0, 
Jl        Js 

and  toy  (2.U),  D  is  open,  hence  dense.  Take  u  in  D.  Then 

L(h™,  hTl,  ...,  h7s)  =  L(h>(-,  u)),  ...,  h^(P(.,  U)))  =  0, 

and  the  equation  is  nontrivial,  i.e. 


£(0,  ...,  h7«)^o, 


*i       '   K 


toy  definition  of  D.  Thus  the  elementary  observatoles 


{h™,  u  in  D,  j  =  1,  ...,  p), 

J 


are  algebraic  over  A°,t+1.  Let  Q(TX,  ...,  \)     toe  the  polynomial 
h*°P  considered  as  a  polynomial  in  T^  ...,  Tm  (input  variatoles)  with 
coefficients  in  A(x).  We  showed  atoove  that  the  elements  Q(u),  u  in  D, 
are  algebraic  over  A°,t+1.  By  (12. 11),  the  span  of  (Q(u),  u  in  km] 
coincides  with  the  span  of  (Q(u),  u  in  D}.  Thus  every  generator 
Q(u)  =  hY*  of  A0,t+2  is  algetoraic  over  A°'t+1  for  all  uw  in  U 


..t+2 


The  result  follows  toy  induction  on  t. 


(d)  This  is  analogous  to  (c) 

(e)  Since  aJ'*  is  generated  toy  L°*  =  span  of  L^'  ,  . ..,  L^  , 

^  A  a     crart£ 


\B)       Since      £»  -LB    gcireiaocu    uy        jjj,  0±««i    ~±        -^       ,      ...,     ~£ 

it  is  enough  to  prove  that  each  L^'0  is  finite  dimensional.  But,  toy  the 
Main  Lemma  (10.7),  L^'*  is  generated  toy  the  coefficients  of 
♦Jd)(8lf  ...,  St). 

(f)  By  (9.5)  and  (10. h),     2  has  a  subsystem  Z   which  dominates 
E   (i.e.  Z   is  a  "subquotient"  of  S).  Thus  (e)  follows  from  (U.2). 

(g)  Apply  (9.3)  to  Zf;  observe  that  A(gn)  =  6n- 
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(h)  By  (c)  above  applied  to  £f, 

0,t 

dim  E,  =  sup  {trdeg  A   }. 

The  result  is  then  clear  by  (5-22)  applied  to  each  A^'  . 

15.  Finite  Realizability  and  Minimality. 

(15.1)  DEFINITION,  f  is  finitely  realizable  iff  f  has  a  finite 
dimensional  realization. 

We  collect  various  characterizations  of  finite  realizability  in  the 
following 

(13.2)  THEOREM.  The  following  statements  are  equivalent; 

(a)  f  is  finitely  realizable. 

(b)  dim  Sf  <  °°. 

(c)  E   is  an  almost-polynomial  system. 

(d)  Q,   is  a  finitely  generated  field  extension  of  k. 

(e)  e-.   A.  =*  A?'*  for  some  t  >  0. 

v       t   ~*T    ~I  — — —     — 

(f)  a*.  =sj,t   for  some  *  >  °- 

(g)  a   is  algebraic  over  A^'   for  some  t  >  0. 


PROOF.  We  prove  (a)  =*  (b)  =*  (e)  =*  (c)  **  (d)  =*  (g)  =*  (f)  =>  (b)  => 


(a). 


.t< 


(a)  =>  (b)  This  is  (l2.12e). 

(b)  =»  (e)  This  is  (I2.12f). 
(e)  =*  (c)  By  definition,  AR,t  is  a  subalgebra  of 

a(ux)  =k[|r  ...,  it3. 

(c)  => (d)  Let  A„  be  included  in  the  finitely  generated 
algebra  k[b1,  ...,  bg] .  Then  0^  is  included  in  the  finitely 
generated  field  k(bj  ...,*,).  By  LANG  [1965,  Chapter  X,  Exercise  61, 
CL,  is  itself  finitely  generated. 
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(d)  =>  (g)  Clear  from  Sf  =  ^  Sf 


0,t 


(g)  =>  (f)  If  A   has  the  same  quotient  field  as  kj    ,     then 

0  t 
A-  is  obviously  algebraic  over  hJ    . 

(f)  =>  (b)  Clear  from  (l2.12e) 

(b)  =*>  (a)  Trivial.  C 

We  consider  briefly  the  topic  of  minimal-dimensional  realizations: 

(I3.3)   DEFINITION.  A  system  Z  is  minimal  iff 

dim  Z  <   dim  Z 

for  any  Z  with  f£  =  f„. 

By  (I2.12f),  Z  is  minimal  iff  dim  Z  =  dim  Z  . 
iz 

For  any  x  in  X  we  define  the  observability  class  of  x, 

(13.1+)   Obs  (x)  :=  {z  in  X  I  hF(z)  =hr(x)}. 

We  are  interested  in  the  case  in  which  Obs  (x)  is  generically  finite: 

(13.5)   DEFINITION.  Z  is  weakly  observable  iff  there  exists  an  open 
dense  subset  U  of  X  and  an  integer  s  >  0  such  that  Obs  (x)  Hu 
has  cardinality  <  s  for  all  x  in  U.  Z  is  weakly  canonical  iff  Z 
is  quasi- reachable  and  weakly  observable. 

Let  Z  be  a  quasi- reachable  realization  of  f.  By  (10. h) ,   there  is 
a  dominating  morphism  T:  Z  ->  Z  =  (X  ,  P  ,  xL  h  ).  Since  T  is  a 

r    r 

system  morphism,  h  °T  =  h  .  Thus 

hr(x)=hr(z)  iff  hJ(T(x))  =hJ(T(z)), 
iff  (observability  of  Zf ! )  T(x)  =  T(z).  So 

Obs  (x)  =  T"  (T(x))  =  fiber  of  T  through  x. 
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If  either  k  =  R  or  k  is  algebraically  closed,  and  if  Z  is  a  quasi - 
reachable  almost -polynomial  system,  then  (h.6)   implies  that 

(13.6)  dim  Obs  (x)  =  dim  Z  -  dim  Z 

for  all  x  in  some  open  dense  set  U.  In  particular, 

(13.7)  THEOREM.  Let  either  k  =  R  or  let  k  be  algebraically  closed. 
The  almost  polynomial  system  Z  is  weakly  canonical  if  and  only  if  Z 
is  both  minimal  and  irreducible. 

PROOF,  ["if"]  If  Z  is  not  quasi-reachable,  then  Zn  has 
strictly  lower  dimension,  contradicting  minimality.  So  Z  is  quasi- 
reachable,  and  there  exists  T:  Z  ~» Zf   dominating.  Apply  (U.6), 
noting  that  n  =  m.  Let  U  :=  j„(X_);  then  (4.6c)  shows  that  the  sets 
Obs  (x)  PlU  have  a  bounded  cardinality  when  x  is  in  U. 

["only  if"]  Irreducibility  is  a  consequence  of  quasi- 
reachability.  Assume  that  dim  Z^  <  dim  Z.  By  (13.6), 


dim  (Obs  (x)  Ou)  =  dim  (Obs  (x)  Hu)  =  dim  Obs  (x)  >  0 
for  almost  all  x,  so  Obs  (x)Hu  cannot  be  generically  finite.      C 

Ik.   Polynomial  Canonical  Systems. 

We  have  proved  in  the  previous  section  that  "f  finitely  realizable" 
is  equivalent  to  "Z_  is  almost  polynomial".  We  now  ask  under  what 
(stronger)  conditions  Z„  is  a  polynomial  system. 

The  precise  "input /output"  condition  for  Z  =  polynomial  is 
immediate  at  this  stage.  Since  A(Z„)  =  A_,  we  see  that  the  canonical 

I      ~T  

system  Z_  is  polynomial  iff  the  observation  algebra  A_  is  a  finitely 
generated  k-algebra.  Some  useful  characterizations  of  this  latter 
condition  are  given  in  the  next 

(lU.l)   THEOREM.  The  following  statements  are  equivalent: 
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(a)  Zf  is  a  polynomial  system. 

(b)  aJ'*  =  Af     for  some  t  >  0. 

(c)  A-  is  integral  over  A  '   for  some  t  >  0. 

PROOF,  (a)  =5>  (b)  Since 

Lr  =    U  £'*> 
~*       t>0  ~* 

there  is  a  t  >  0  such  that  A-'   contains  all  the  generators  of  A^.. 

(b)  =»  (c)  Obvious. 

(c)=>(a)  A^  integral  over  A®'*  implies  g^  =  ^(A^. )  is 
algebraic  over  QCaJ**),  so  trdeg  0^=  trdeg  j^'*  =  finite  by  (l2.12e). 
By  (b)^>(d)  in  (13.2),  Qf  is  finitely  generated  over  k.  By  (1.13a) 
we  conclude  that  A_  is  finitely  generated.  t 

We  shall  now  describe  a  procedure  which  enables  one  to  obtain  a 
polynomial  canonical  realization  when  a  finite  set  of  generators  of 
A.  is  known. 

Assume  now  that  Af  is  generated  by  t., ,  . . . ,  ^n  as  a  subalgebra 
of  1.     In  particular  trdeg  A^.  <  n,  so  eji^.  is  an  isomorphism. 
Thus  any  relation 

(1U.2)   Vu)=^Pa,i(V  "•>  Vu° 

where  u  =  (u, ,  . . . ,  u  )  •  is  in  U  and  the  P„  .  are  finitely  many 
x  j.       m  01,1 

polynomials,  is  equivalent  to 

€nVu)  =gPa,i(en^  — '  ^n^' 

The  transition  map  of  the  canonical  realization  Ef  is  given  by  X(©1), 
with  0   defined  as  in  (6.7),  e±:  A^   -^[S-J:  *  »-»i|r(S1).  Thus  the 
canonical  realization  may  be  obtained  from  Gn+11lf1,  . .  - ,  en+1*n  as 
follows : 
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STEP  1.  Substitute  g.,  hT.  and  |..h|.  .  n,  1  =  1,  . . . ,  m, 
11    i       io    i,j-l'       '    '   ' 


j  >  2,  in  each  e  ,t. 

—  '  n+1 


STEP  2.  Write  e  ,t.  as  a  polynomial  in  the  variables 
n+1  l 

T  =  T.,  ....  T  : 
1'    '     m 

STEP  5.  Express  each  q.   as  a  polynomial  combination  of 
«n*l'   •-  €nV 

R  n 
this  is  always  possible  by  (10. 5),  since  ei,  •  •-,  entn  generate  Af'  . 

STEP  U.  Since  each  \|£"  is  in  A^,n,  J  «  1,  . ..,  P,  there 

are  polynomials  h.  with 
J 

VfJ?"VeA'  ••"  enV'   3-1,  -..,  P- 

Then  Z»  is  the  quasi-reachable  subsystem  Zq  of  £  =  (k  , 
(Pia),  (hp  ...,  h  ),  0),  i.e.,  2f  is  given  by  the  equations 

'x±(t   +  1)  -Z  ^(^(t  +  1).  •••>  *n(*  +  !))>   xf  :=  °> 
(U4.5)   ^  i  =  1,  ...,  n, 

'y..(t)  =  h  (x  (t),  ...,  xn(t)),   j  =  1,  ...,  p, 

subject  to  the  constraints 

Q(X;L(t),  ...,  xn(t))  =  0, 

for  each  polynomial     Q     for  which     Q(eJV    '">   €iAi^  =  °" 


(11+.6)   EXAMPLE.  A  simple  illustration  of  the  above  procedure  is  the 
Let  f  =  f„,  where  £  is  given  by  m  =  p  =  . 

x(t  +  1)  =  x2(t)  +  u(t),  x*  :=  0,  y(t)  =  x2(t) 


following.  Let  f  =  f„,  where  £  is  given  by  m  =  p  =  1,  X  :=  k  and 
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Thus,  f(ut,  ...,  u^  =  ((  . ..((u£  +  ut_x)"  +  ut_2^  +  •••  +  u2)2  +  \f- 

Note  that  f  =  (f  +  u)  ,  or  equivalently,  t.,(u)  =  (ty     +  u)   for  all 

2      2 
u  in  U.  Thus  i|r_  generates  A  ,  and  n  =  1.  Since  e0i|r_  =  (|  +  |  )  , 

step  1  gives  (^  +  T)  .  But  e1tf  =  |  ,  so 

(lx  +  T)  =  («■,♦,»  +  T)  .  Thus  (12.5)  becomes 

(1U.7)   x(t  +  1)  «  (x(t)  +  u)2,  xtt  =  0,  y(t)  =  x(t), 

with  X„  =  k.  The  procedure  Z  ->  Z   has  identified  the  pairs  of 
unobservable  states  (x,  -  x],  giving  the  algebraically  observable 
system  Z-.  On  the  other  hand,  (complete)  reachability  of  Z  became 
just  quasi -reachability  of  Z-.  A  more  direct  way  of  obtaining  Zf 
in  those  cases  (as  with  the  present  example)  in  which  a  quasi-reachable 
realization  Z  is  already  known  is  through  (10. h) .     By  (12. 12b)  and 
(lU.lc)  we  must  generate  the  algebras  A  '    (Z)  until  A  '    (Z)  =  A   '   (Z) 

for  some  t.  Calculating,  A  '  (Z)  =  algebra  generated  by  h  =  1s[i\   ]; 

0  1  **"    2  2     2 

A  '  (Z)  =  algebra  generated  by  t]   and  by  all  (r\     +  u)  ,  u  in 

~*  2 

k  =  (by  10.7)  algebra  generated  by  r\       and  by  the  coefficients  in  u  of 

i\k  +  2t)2u  +  u2  =  k[T]2  r\k]    =  k[T)2]  =  A°'°(Z).  So  A(Z)  =  A°'°(Z). 

Restricting  A(P)  to  A(Z)  one  obtains  again  (ik.j). 

We  shall  see  in  Section  (1.6)  that  Z^,  is  polynomial  for  a  wide  class 
of  response  maps,  which  includes  all  those  maps  defined  by  recursive 
polynomial  equations. 


15.  Bounded  Maps. 

An  important  role  in  studying  the  condition  dim  L _  <  »  is  played  by 
the  response  maps  introduced  in  the  following 

(15.1)   DEFINITION,  f  is  bounded  iff  deg  f  <». 

(Recall  Definitions  (5.4)  and  (12. 5)  for  deg  f.) 

The  system- theoretic  meaning  of  a  bounded  map  is  that  no  input  is 
raised  to  a  power  higher  than  a  certain  bound.  Very  simple  systems  can 
give  rise  to  nonbounded  maps.  For  instance,  let 
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Z  :  x(t  +  1)  =  x2(t)  +  u(t);  x*  :=  0,  y(t)  =  x(t). 

Computing  f„  by  iteration  shows  that  it  is  clearly  not  bounded. 
However,  the  notion  of  bounded  map  is  general  enough  to  accommodate 
those  classes  of  response  maps  for  which  a  realization  theory  exists  in 
the  literature  today.  These  cases  correspond  to  the  next  three  examples: 

(15.2)  EXAMPLE,  f  is  linear  iff  E  \|f_  =  0  and  each  monomial  in  \|r 
has  degree  =  1.  In  particular,  deg  f  =  1  <  ». 

(15.3)  EXAMPLE  (KAIMAN  [1969,  1976]).  f  is  multilinear  of  order  m 
iff  all  the  nonzero  monomials  appearing  in  \|r «  are  of  the  form 


«£l...£S   a  in  H, 


i.e.  flu*  is  m-linear  as  a  map  U  =  (km)  =  (k  )m  ->  Y,  for  all  t  >  0. 
Then,  deg  f  =  1  <  °°.  (More  generally  one  may  consider  s-linear  maps 

f:  U*  X  ...  X  U*  ->Y,  where  U.  :=  kr±,  i  =  1,  ...,  s  and 

1        s    '         1      ' 

r,  +  . . .  +  r  =  m.  Here  we  took  all  r .  =  1  just  for  notational 
Is  i 

simplicity. ) 

(15.1+)   EXAMPLE  (BROCKETT  [1972],  ISIDORI  and  RUBERTI  [19751,  ISIDORI 
[1975,  197*4],  FLTESS  [1973],  D'ALESSANDRO,  ISIDORI  and  RUBERTI  [197U]). 
f  is  internally-bilinear  iff  all  monomials  appearing  in  i|rf  are  of  the 
type 

a|aill...|aitt,   t  >  0,  a  in  k. 
ill    it* 

Again  deg  f  =  1  <  °° . 

Note  the  difference  between  the  bilinearity  in  (15.*0  with  that  in 
(I5.3)  (with  m  =  2).  Neither  of  these  implies  the  other. 

(I5.5)   THEOREM.  Let  f  be  a  response  map.  The  following  statements 
are  equivalent: 
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(a)  f  is  bounded  and  finitely  realizable. 

(b)  f  is  bounded  and  realizable  by  a  polynomial  system. 

(c)  f  is  bounded  and  22   is  a  polynomial  system  (i.e.,  Af 
is  finitely  generated). 

(d)  f  is  bounded  and  Lf  is  finite  dimensional. 

(e)  dim  Lf  <». 

(f)  f  admits  a  realization  with  X  =  k   and  transitions  of 
the  form 

x(t  +  1)  =  F(u(t))x(t)  +  G(u(t)), 

vhere  F(-)  is  a  matrix,  and  G( • )  a  vector,  of  polynomial  functions. 

(g)  f  admits  a  realization  as  above  with,  further,  h  linear, 
(h)  f  admits  an  observable  realization  as  in  (g) . 

(i)  f  admits  a  realization  as  in  (h),  with  G  =  0. 

PROOF.  The  only  nontrivial  implications  in  the  diagram 

(d)  =*  (e)  =»  (i) 

(a)^    J(c)      >), 

(b)  «- (f )  *=  (g) 

are  (a)  =>  (d),  (e)  =»  (i),  and  (f)  =S>  (a),  which  we  now  prove. 

(a)  =3>  (d)  Let  dim  2  =  n  and  deg  f  =  d.  By  (I2.12g), 
e  :  L  *  L*,n.  By  (12.6),  deg  f w  <  d  for  all  w  in  U  .  So 
deg  ejff   .(w)  <  d  for  all  w  in  U*  and  j  =  1,  . . .  ,  p.  Thus  Lf'n 

is  a  linear  space  generated  by  a  set  of  polynomials  in  nm  variables  of 

R,n   /   \nmd  . 
joint  degree  <  nmd.  Thus  dim  Lf  =  dim  L^'     <  ^nm;    <  °°. 

(e)  =>  (i)  Let  dim  L.  =  n,  and  let  {*-,_,  ...,  ^3  be  a  basis 
of  L  as  a  subspace  of  1.  By  (11. 5)  the  polynomials  qia  introduced 
in  (14.5)  are  in  l£,R.  Thus  each  Pia  in  (ik.k)  can  be  chosen  linear, 
and  (i)  follows  from  the  formulas  (lU-5). 
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(f)  =-,  (a)  Let  F(u)  a  (F. .(u)),  a  matrix  of  polynomials,  and 

let  h(u)  =  (hv    ...,  h  ),  a  vector  of  polynomials.  Denote 

dee  F  :=  sup  {deg  F,  .}  and  deg  h  :=  sup  {deg  h  }.  Then 
i,3      id  J       ° 

deg  f  <  deg  F.deg  h  <  ».  C 

We  can  see  from  either  (f)  or  (g)  above  that  the  essential 
characteristic  of  hounded  maps  is  the  absence  of  any  intrinsic  nonlinear 
feedback. 

The  above  theorem  shows  that  the  problem  of  deciding  whether  a 
bounded  map  is  realizable  is  equivalent  to  checking  whether  a  realization 
as  in  (g),  (h),  or  (i)  exists.  We  shall  study  in  Chapter  V  the  connection 
between  such  special  realizations  and  some  questions  of  automata  theory. 

These  special  configurations  are  highly  appealing  because  they  can 
be  studied  by  linear- algebraic  methods.  It  should  be  noted  carefully, 
however,  that,  for  certain  questions  like  synthesis,  alternative  (lower 
dimensional)  representations  may  be  more  useful.  This  is  illustrated 
by  the  following  example.  Consider  the  one- dimensional  system  2: 

(15.6)   x(t  +  1)  =  x(t)  +  u(t),  x8  :=  0,  y(t)  =  xS(t), 

where  s  >  1  is  an  arbitrary  integer.  Then  dim  fy  =  =  s  and  therefore 
(see  Chapter  V)  representations  as  in  (g),  (h),  or  (i)  have  dimension  at 
least  s.  The  system  2  may  be,  however,  obtained  back  from  any 
observable  realization  2  as  in  (h)  in  the  following  way.  Since  2  is 
canonical  (easy  verification)  by  (11.3)  2  is  a  closed  subsystem  of  2. 
Indeed,  the  reachable  set  of  2  will  be  one-dimensional,  and  the 
restrictions  of  P  and  h  to  this  reachable  set  will  define  a  subsystem 
isomorphic  to  2. 

l6.  Input /Output  Equations . 

We  shall  now  study  the  existence  of  input /output  equations  and 
relate  them  to  various  finiteness  conditions.  We  treat  only  the  case 
p  =  1,  since  when  there  is  more  than  one  output  channel  one  may 
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separately  study  each  7T.of,  j  =  1,  ...,  p.  We  let  S.  .  be 
indeterminates,  i  =  1,    ...,   m,  j  >  1,  and  denote  S.  :=  S..  .,  ...,  S  .. 

(16.1)  DEFINITION.  Let  r  >  0  tie  an  integer,  and  let  E  be  a 
polynomial  in  k[L  ,  L  ,  . ..,  L  ,  S  ,    . ..,  S _],  nontrivial  in  L  . 
A  pair  (y,  u),  y  in  Y,  u  in  U  satisfies  E  iff 

E(y(t),  y(t  -  1),  ...,  y(t  -  r),  u(t  -  l),  ...,  u(t  -  r)  =  0 

for  all  t  _in  Z.  The  response  map  f  satisfies  the  algebraic 
difference  equation  E  iff  every  input /output  pair  of  f  satisfies  E. 
The  order  of  the  equation  E  is  r.  The  equation  E  is 

(a)  rational  when  degT  E  =  1,  i.e. 

1  Lo 

E  =  E^,  ...,  Lp,  Sr,  ...,  S^  +  E2(Lr  ...,  Lr,  Sr,  ...,  S±); 

(b)  integral  when 

E  =\(Sr,    ...,  S1)L^  +E2(LQ,  L1,  ...,  Lr,  S^  ...,  S1), 

with  degT  E  <  s; 
Lo  2 

(c)  recursive  when  rational  and  integral,  i.e. 

E  =  E1(Sr,  ...,  S^  +  Egfly  ...,  Lr,  Sr,  ...,  S^; 

(d)  affine  when 

E  =  Jo  W  ■••>  si)Ld  +  EX'  •••>  Bi>- 

The  main  result  of  this  section  is  the  following 

(16.2)  THEOREM.  Let  f  be  a  polynomial  response  map, 
(a)  The  following  statements  are  equivalent; 

(i)  f  is  finitely  realizable; 
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(ii)  f  satisfies  an  algebraic  difference  equation; 
(iii)  f  satisfies  a  rational  difference  equation. 

(b)  The  following  statements  are  equivalent; 

(i)  f  is  bounded  and  finitely  realizable; 

(ii)  f  satisfies  an  affine  difference  equation. 

(c)  If  f  satisfies  an  integral  difference  equation  then  £ 
is  a  polynomial  system  and  f  satisfies  a  recursive  equation. 

The  proof  of  this  theorem  will  be  postponed  until  we  establish  some 
technical  facts  about  algebraic  difference  equations. 

The  study  of  such  equations  will  be  algebraized  through  the 

introduction  of  the  field  K  =  k({S.  .,  i  =  1,  ...,  m,  j  >  1})  obtained 

K    K 
by  adjoining  the  indeterminates  S.  .  to  k.  Let  L_,  A-  be  the 

K-subspace  and  K-subalgebra  of  ¥R  generated  by  \|rf  and  (recall  (5. 17)) 

all  the  tx.(S  +  ,  ...,  S,)  (note  order  of  arguments!),  for  all  t  >  0.  Let 

V  it  1 

S?:=&(Af). 

(16.3)   LEMMA.  Let  f  be  a  polynomial  response  map.  Then 

(a)  f  satisfies  an  algebraic  difference  equation  if  and  only 
if  trdegK  Q*  <  « . 

(b)  f  satisfies  a  rational  difference  equation  if  and  only  if 

jr 

£_  is  a  finitely  generated  field  extension  of  K. 

(c)  f  satisfies  a  recursive  difference  equation  if  and  only  if 
it 

A_  is  a  finitely  generated  K- algebra. 

(d)  If  f  satisfies  an  integral  equation  and  a  (possibly 
different)  rational  equation  then  f  satisfies  a  recursive  equation. 

(e)  f  satisfies  an  affine  difference  equation  if  and  only  if 


din^  Lf  <  » . 


PROOF.  We  begin  with  some  general  remarks.  Observe  that  an 
equation  E  of  order  r  is  equivalent  to 
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(16 A)   E(fUr-"ul,  f^...«2,  ...,  t*rf  t>   ^  ...,  ^  =  0. 

for  all  (u  ,  ...,  u.)  in  U*.  In  terms  of  Volterra  series,  (l6.k)   is 
equivalent  to 

(16.5)  E(*f(Sr,  ...,  Sx),  VSr-r  '••'  Sl}'  "•,  V8^1  V 

sr,  ...,  s1)  =  o. 

A  change  of  variables     S£  nSi+1,     J«l,    ...,  r,      Si^3!*     and 

|jHL  ,     if     3  >  1     transforms   (16.5)   into 
0        3-1 

(16.6)  EUf(sr+1,  ...,  sx),  tf(sr,  ...,  sx),  ...,  tf(s2,  sx),  ^(S^, 

sr+1,  ...,  s2)=o. 

Thus  an  equation  of  order  r  gives  rise  to  an  equation  of  order  r  +  1. 

We  now  prove  (a).  Let  f  satisfy  E,  with  E  of  order  r 
smallest  possible.  Let  c(L1,  . . .,  Ly,  Sr,  . . .,  S±)  4  0  be  the 
leading  coefficient  of  E  as  a  polynomial  in  LQ.  Suppose  that 

(*)     e(*f(flr-l'  '  "  Sl}'  •"'  V  S*"  '"'  Sl}  =  °* 

Dividing  by  S   if  necessary,  we  may  assume  that  c(L1,  ...,  L^,   0, 
St-1>    '"'   Sl)  ^  °*  ^^ 

c(+f(Sr-l'  •'  V'  — '  V  °>  Sr-1'  —'  Sl}  =  °; 

is  an  equation  for  f  of  order  <  r  -  1,  contradicting  minimality  of  r. 
Therefore  (*)  is  false.  So  (ik.k)   is  a  nontrivial  equation  for 
\|r  (S  ,  ...,  S1)  with  coefficients  in 


K 


:r  :«K(tf,  ...,  4rf(Sr_r  ...,  Sx)) 


Since  the  derivation  of  (16.5)  from  (16.1+)  is  just  a  relabeling  of 
variables,  the  same  argument  gives  a  dependency  for  ^f(3r+1>  •••>  s1> 
over  K(+f,  ...,  +f(Sr,  ...,  S^).  Since  algebraic  extensions  of 
algebraic  extensions  are  again  algebraic  (formula  (U.l))  we  conclude  that 
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^(S, 


,  S  )  is  also  algebraic  over  K  ,  and  by  induction  also 


(&     is  algebraic  over 


Thus 


trde 


h.  £f 


=  trde 


K  <  r  <  °°. 


H\^ 


Conversely,  assume  that  trdegv  £.  <  °° .  Since  the  t-.(s .  ,  . . . ,  S. ) 

tr  R.    ~  y  It  Al 

generate  Q^,     there  exists  an  r  >  1  such  that  0,  is  algebraic  over  K  . 
In  particular,  \|rf(S  ,  ...,  S  )  is  algebraic  over  K  ,  so  there  is  an 
equation  of  algebraic  dependence  Q(i|rf(S  ,  . ..,  S  ))  =0  with  coefficients 


in  K  .  Multiplying  by  a  common  denominator  if  necessary,  we  may  assume 

that  all  coefficients  of  Q  are  in  k[  [S .  . }  U  {*-,  . ..,  ^(S  ,, 

ij      i       r  r-l 

Dividing  if  necessary,  we  may  further  assume  that  no  S  .  divides  Q. 
Therefore  the  evaluation  S.  .  i->  0  for  all  j  >  r  gives  an  algebraic 


,  SjHl. 


equation  of  order  r. 

(b)  A  rational  equation 
'Vf(Sr,  ...,  S1)  belongs  to  Kr'\ 


E  is  equivalent  to  the  statement 
Since  CL  is  the  union  of  the  I 


(b)  is  clear. 

(c)  Existence  of  a  recursive  equation  of  order  r  is  equivalent 
to  i|f_(S  ,  ...,  S.)  belonging  to  the  algebra 

\  -«*f,  -..,  *f(Sr-l'  ...,  S1)], 

which  is  in  turn  equivalent  to  A-  being  finitely  generated. 

(d)  An  integral  equation  corresponds  to  ^/Ss   ,  . ..,  S..)  being 

integral  over  A  ,  so  by  induction  as  in  passing  from  (16.5)  to  (16.6), 

**  K  K      K 

we  conclude  that  A-  is  integral  over  A  .  Since,  by  (b),  CL.  =  Q(A~) 

is  finitely  generated  over  K,  we  conclude  from  (1.13a)  that  A„  is 

finitely  generated. 

(e)  Similar  to  (c);  just  observe  that  an  affine  equation  of 

order  r  is  equivalent  to  tyAs   ,  . . . ,  S.. )  belonging  to  the  span  of 

1  r       x 

im$    •••>  ^f(sr_i>  •••»  si)  as  a  8Pace  over  K* 


(16.7)   LEMMA.  Let  f  be  a  polynomial  response  map.  Then 
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(a)  trdegK  CL.  <  °°  if  and  only  if  trdeg  g_  <  «>. 

(b)  dinL.  l£  <  °°  if  and  only  if  dim  L^  <  «>. 

(c)  If  L  is  a  finitely  generated  K- algebra,  then  A_  is 
finitely  generated  (over  k). 

PROOF.  We  first  prove  that  the  finiteness  statements  on  L_, 

K        K  ^^ 

A.,  and  CL.  imply  .the  corresponding  conditions  on  .L-,  A^,  and  CL.. 

By  (6.3)  these  statements  are  equivalent  to  the  existence,  respectively, 

of  an  affine,  recursive  or  algebraic  equation  E  for  f .  Let 

c(L.,  . ..,  L  ,  S  ,  . ..,  S.. )  be  the  leading  coefficient  of  E  as  a 

polynomial  in  L  .  Let 

D  :=  {(ur,  ...,  ux)  in  if  |  c(tf(ur-1,  •••,  1^),  ...,  t^), 

••-,  fp,  ur,  ...,  ux)  =  0}. 

By  (6.U),  ^_(u  ,  ...,  u.)  is  in  the  space  L^,  [respectively  in  the 
algebra  A.,  respectively  algebraic  over  0^.1  for  all  (uy,  ...,  u^) 
not  in  D.  The  conclusion  is  then  clear  from  the  Main  Lemma,  Part  II 
(12.11)  and  (I2.12e). 

Now  we  prove  the  "if"  parts  in  (a)  and  (b).  By  the  Main  Lemma 

(10.7),  the  coefficients  of  *f(Sr,  ...,  S^,  considered  as  a  polynomial 

in  S,,  ....  S  ,  are  in  L-.  Thus  L_  is  in  the  linear  space  over  K 

1     '  r         ""i        ^1  v 

generated  by  L-,  which  has  a  finite  basis  by  hypothesis.  Since  &f  is 

generated  as  a  field  by  the  elements  of  L^,     the  conclusion  about  (^ 

is  also  immediate.  " 

We  may  now  complete  the 

PROOF  OF  (16.2).  (a)  Follows  from  (l6.3a,b),  (l6.7a)  and  the 
equivalence  of  (a)  and  (d)  in  (I3.2). 

(b)  Clear  from  (l6.3e)  and  (16.7b). 

(c)  An  integral  equation  is  in  particular  an  algebraic  equation, 
so  by  (a)  f  satisfies  a  rational  equation.  The  conclusion  is  then 
clear  from  (16.3d),  (16.3c),  and  (l6.7c).  n 
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(16.8)   REMARKS,  (a)  For  a  finitely  realizable  f  there  is  up  to 
scalar  multiplication  a  unique  algebraic  difference  equation  E  with 
the  two  properties:  (i)  E  is  of  minimal  order  r,  and  (ii)  E  is 
irreducible  as  a  polynomial  in  k[LQ,  1^,  . ..,  L^,  S^,  ...,  S^] . 
Uniqueness  follows  easily  from  the  discussion  in  HODGE  and  PEDOE 
[1968,  page  110]. 

(b)  It  is  easily  verified  that  two  polynomial  input /output 
maps  satisfying  the  same  rational  equation  necessarily  coincide. 
Moreover,  by  simple  division  of  polynomials,  f  (more  precisely,  \|rf) 
can  be  reconstructed  from  the  rational  equation  E. 

(c)  It  can  be  seen  by  a  counterexample  that  Z«  =  polynomial 
system  does  not  imply  that  f  satisfies  an  integral  difference  equation. 

17.  Jacoblan  Condition. 

We  now  show  how  to  check  the  condition  "f  is  finitely  realizable" 
by  examining  increasing  truncations  of  the  Volterra  series.  We  take  k 
to  be  a  field  of  zero  characteristic  (when  char  k  =  p  ^  0  one  may 
generalize  the  criterion  via  spaces  of  differentials). 

As  in  the  previous  section,  we  assume  p  =  1  without  loss  of 
generality. 

Denote  by  D  ,  l<r<m,  s>l,  the  operator  which  takes 
rs    —      ~-  - 

partial  derivatives  of  polynomials  with  respect  to  the  indeterminate 

I  .  Let  DP,  s  >  1,  be  the  row  vector  (D  P,  . . . ,  DP) . 


(17.1)   DEFINITION.  The  n-th  Jacobian  matrix  Jn(f)  g£  f  If* 

D  e  *  D^e  \|r        . . .  D  e  t, 

1  n  f  2  nyf  n  n  f 

Vn*f(Sl}  '  •'•  VnW 

VnVSn-l'    •••'   Sl}  '  ••'     VnVSn-l'    "'  S1}J 


/ 


j,(f)  .- 
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(17.2)   EXAMPLE.  Let  f  be  a  linear  response  map  with  "impulse 


response"  A^  ...,  An,  ...  ,   i.e.  \|ff  =  Z  a  ^ 


U  iJ 


and 


Ai  =  ^alj'  •••'  V)^'  J  -  lm     Then 

r   m  n   m 

n  f  r'       1'   j=l  i=l  ij  i,r-j   j=l  i=l  i,j+rbij' 


so 


VnW  —B^-A 


s+r 

Thus  J  (f )  is  the  n-th  principal  minor  of  the  block  Hankel  matrix  of 
f  (KALMAN,  FALB,  and  AKBIB  [I969,  Chapter  10]),  and  it  is  well-known 
that  realizability  of  f  is  equivalent  to  the  existence  of  an  integer  s 
such  that  rank  J  (f)  <  s  for  all  n.  The  Jacobian  of  f  gives  a  new 
way  of  interpreting  the  classical  Hankel  matrix  of  f .  D 

(17.3)   THEOREM,  f  is  finitely  realizable  if  and  only  if  there  exists 
an  s  >  0  such  that 

rank  J.(f )  <  s  for  all  t  >  0, 

i.e.,  if  and  only  if  every  (s  +  l) -minor  of  J+(f)  is  zero  for  all 
t  >  0. 

PROOF.   By  HODGE  and  PEDOE  [1968,  III. 7,  Theorem  III], 
rank  Jn(f)  =  trdegK  K[eQtf,  ...,  enVSn-l'   "'  Sl)]* 

(K  is  k({S..))  as  in  the  previous  section.) 

["only  if"]  By  (l0.12h),  there  is  an  s  such  that 
trdeg.  A^l  <  s  for  all  n  >  0.  By  the  Main  Lemma  (10.7)  each 

A.    "™i  —  — 

6  V*.(S     it  •••»  S, ),  j  =  0.  ....  n  -  1  is  a  K- linear  combination  of 
n  fx  n-1*    n 

elements  of  A_.  So  rank  J  (f)  <  s. 
*~x  n    — 

["if"]  If  rank  J  (f )  <  s  for  all  t  >  0,   in  particular 
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(17.1+)   trdegK  K[€t+f,  ...,  et+f(SB,  ...,  S^]  <  s, 

for  all  t  >  s.  Let  B  be  the  subalgebra  of  Y   generated  by 

ff,    ...,  tf(Sg,  ...,  S1).  By  (17.10,  trdegK  et(B)  <  s  for  all 

t  >  s.  It  follows  from  (5.22)  that  trdeg„  B  <  s.  So  there  is  an 

r  <  s  such  that  ty _(S  ,  . . . ,  S1 )  is  algebraically  dependent  over 

\|r_,  ...,  t,(S  n,  ...,  S..),  and  the  conclusion  follows  from  (l6.7a).   n 
r       r  r-x       x 

(17.5)   EXAMPLE.  As  an  application  of  (17. h),   we  prove  that  the 
polynomial  response  map  t,     with  m  =  p  =  1  and 

♦f  !-*u+&+*l3  +  4+*x5+  "•  ' 
is  not  finitely  realizable.  Since 

€nVSr>  -'  Sl}  -  Sr  +  Sr-1  +  -  +  Sl  +  ^  +  -  +  C' 

for  any  s  <  n,  D  e  fJs  ,    . . .,  S. )  =  (r  +  s)£,  "  ;  this  latter 
^    —  '   s  n  f  r       1  Is 

expression  is  also  D  €  _11lff(S  ,  ...,  S.)  if  s  <  n.  Therefore 


J„<"  - 


Vi<f> 


<* .  x>e2 


and  so 


det  Jn(f)  =  (2n  -  1)  det  J^Wl^      +  t(Sll'        ''   ^ 

where  deg,  t  <  2n  -  2.  Since  I,   is  algebraically  independent  over 

Xn  xn 

k(|11 ,  ...,  !..   .),  an  induction  on  n  shows  that  det  J  (f)  4-   °  for 

all  n.  □ 


18.  Some  Examples  and  Counterexamples. 

We  discuss  now  several  examples  related  to  canonical  realizations  and 
input /output  equations. 
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(18.1)  EXAMPLE.  We  wish  to  illustrate  the  calculation  of  a  nonpolynomial 
canonical  system,  via  (10.4).  Consider  the  system  S  :=  (k  ,  P,  h,  0), 
where  m  =  p  =  1  and  the  equations  of  E   are 

xx(t  +  1)  =  x^t)  +  u(t), 
xg(t  +  1)  =  x^tJXgCt)  +  xx(t)  +  Jtg(t), 
y(t)  =x2(t). 

The  2-step  reachability  map  of  E   is 

(18.2)  %:   U2  -k2:  (u2,  ^h/v^I 

which  is  obviously  onto.  So  E   is  reachable. 

We  want  to  calculate  a  canonical  realization  of  f  :=  f„  .  Since  E 

obs  °      °  ° 

is  reachable,  E   -  EQ  ;  we  apply  the  construction  in  (10.4)  to  obtain 

*o  c  +■ 

E_  .  The  observation  algebra  A(E  )  will  be  determined  as  .LI  A  '  (E  )  by 

to  ""*    o  t50  o 

induction  on    t,     using  the  Main  Lemma  (10. 7).     Write     A(x)  =  k[T]1,  r\  ],     so 
A(P):  t^  •-*  t^  +  T,     ti2  h*  ti^g  +  n±  +  n2, 

and 

h  =  tj2- 

To  simplify  calculations,  let  rj.  :■  r\4  +  1,  i  =  1,  2,  so  that 

A(P):  n1  mt^  +  T,  t)2  ^^2  and  h  -  T|g  -  1.  Then  A  '  =  k[i)2  -  1]  = 

=  k[r)_].  By  induction, 


because 


.0,  t      -r"      *«■  -   ~tn 

A  =   klTlg,    TJgTlp     ...,    TJgTl^, 


A(P)(^*)  -A(p)(n2).(A(p)(fl1))t 


=  \\(\  +  T) 
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V   ,tx-  *j+lmt-j 

so  the  new  generator  tjp%+   appears  as  the  coefficient  of  T  .  Thus 

(18.3)  hf     =  A(E0)  =  k[  ((t,2  +  DC^  +  1)*,  t  >  0)] 
o 

is  not  a  finitely  generated  algebra.  Therefore  Sf   is  not  a  polynomial 

o 
system. 

By  (3-17)  the  canonical  state  space  Xf  =  X(Af  )  =  Xtkl^T^, 

lie 

2 


t  >  0]  )     is  an  almost-variety  which  can  he  represented  by  the  principal 
open  set     D  :=  {tj2  4  -  1)     in    k2  =  X^t^,  t^])     (or,  equivalents, 
{^^0}     in    X(k[^r   -n2]))     plus  an  extra  point     {*);     see  (3-19).     The 
morphism 

T:  X^X°bS  =Xf  , 
o 

in  (10. 1+)  is  the  map  X(i),  where  i:  Af  -»k[nr  t»2l   is  the  inclusion 
map.  Therefore 

T(x)  =  x  if  x  is  in  D,   and 
T(x)  =  *  if  x2  =  -  1. 

Since  T  is  a  k-system  morphism,  the  transitions  in  Sf   are 

P  (x,  u)  =  T(P(x,  u))  if  x  is  in  D,   and 

*o 
P  (*,  u)  =  *  for  all  u  in  U. 

fo 

The  initial  state  in  Z        is  0  in  D  and  the  output  is  given  by 
o 

h(x)  =  x2  if  x  is  in  D,  and 

h(*)  =  -  1. 

Note  that  T  identifies  precisely  those  states  of  XE   which  are 
indistinguishable . 
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(18.U)   EXAMPIE.  Consider  Z   and  f   as  in  (l8.l).  We  now  prove 
that  there  exists  no  polynomial  realization  of  f   in  which  every 
pair  of  reachable  states  is  distinguishable.  In  other  words,  it  is 
impossible  to  find  any  polynomial  system  Z  realizing  f   and  a 
one-to-one  abstract  system  morphism 

T  :  Z   -»Z, 
ac   ac    ' 

where  Z    is  the  abstractly  canonical  realization  of  f  .  Our  claim 
ac  ^  o 

depends  on  two  facts,  (i)  Let  A  :=  A(gg)(A(Z))  C  A(U*) .  Then  A 
is  maximally  separating  (l.ll)  with  respect  to  A(U2).  Indeed,  write 


ACU2)  =  k[Tx,  Tg];  by  (18.2)  and  (18.3), 


t 


A  =  k[{(T2  +  1)(TX  +  T2  +  1)  ,  t>0)]. 

With  a  change  of  variables  T2  :=  Tg  +  1,  T£  :=  T±   +  Tg  +  1  in  A(tT), 
A  becomes  k[ (TgT*,  t  >  0}).  By  (1.12),  A  is  maximally  separating, 
(ii)  Now  our  claim  follows  from  the  following  more  general  fact: 

(18.5)   LEMMA.  Let  f  be  any  polynomial  response  map.  Assume  that 
(i)  the  n-step  reachability  map  gf    of  Zf  is  onto  and  (ii) 
A(g   )(Xf)  is  maximally  separating  in  A(Un).  Then  any  quasi- 
reachable  Z  realizing  f  whose  reachable  states  are  distinguishable 
is  isomorphic  to  Zf. 

PROOF.  Let  T:  Z  -» Z   be  the  unique  dominating  morphism  (10.4). 

We  must  prove  that  T  is  an  isomorphism.  The  restriction  of  T  to  the 

reachable  part  Z_  of  Z  induces  a  morphism  of  abstract  systems 

T  :  Z^  -^f*  Since  by  hypothesis  Z_  is  abstractly  canonical,  TR  is 

a  bisection  (7.8).  Therefore  2L  is  also  reachable  in  n  steps.  In 

particular,  Z  is  quasi-reachable  in  n  steps  and  A(gn)  is  one-to-one. 

Let  B  :=  A(t^)(Xs).  Since  T°gn  =  gf  ft,  T  is  an  isomorphism  iff 

B  =  A  :=  A(gJ„  )(xj.  By  definition  of  "maximally  separating"  (l.ll)  it 

xof,n   f  ~ 

will  be  enough  to  prove  that  B  separates  no  more  points  of  IT   than  A. 

So  take  v,  w  in  U   and  suppose  that  A  does  not  separate  v  and  w, 
i.e. 
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(18.6)  a(v)  =  a(w)  for  all  a  in  A. 

By  definition  of  A(g_  _),   (18.5)  is  equivalent  to 
c(gf  n(v))  =  c(gf  n(w))  for  all  c  in  Af, 

in  other  words,  gf  (v)  =  gf  n(w) .  Since  gf   =  T°gn  and  T  is 

one-to-one  on  reachable  states,  we  conclude  that  g  (v)  =  g  (w). 

'  &n     Dnv  ' 

So  d(g  (v))  =  d(g  (w))  for  all  d  in  A(X),  which  means  that 

b(v)  =  b(w)  for  all  b  in  B,  i.e.  neither  does  B  separate  v  and 

w. 

When  k  =  R  or  k  is  algebraically  closed  the  hypothesis  of 
Lemma  (18.6)  can  be  weakened  considerably,  replacing  (i)  by  just 
dim  IL  =  n.  The  (easy)  proof  of  this  stronger  lemma  uses  (h.6)   plus 
(18.6). 

(18.7)  EXAMPLE.  We  continue  to  investigate  E  ,  f  ,   and  determine 
the  (unique)  irreducible  equation  of  minimal  order  satisfied  by  f  • 

Let  T).,  T)2  be  coordinates  for  S   defined  as  before. 
Working  in  £(2,), 

a       Uo         *  »  UoUn         *  *      *  «■ 

h  +  1  *=  T)2,  h  ^  +  1  =  Tj^g  and  h  *  x  +  1  =  t^  +  \^2, 
for  all  (u-,  u  )  in  IT.  Since 

V2  =  <V«2)2^2  =  (hU2  +  l)2/(h  +  l}> 
we  conclude  that 

(h  +  l)(hU2Ul  +  1)  -  (h^  +  if  -    (hU2  +  l)(h  +  l)u2  =  0, 

for  all  (u2,  u  )  in  U2.  Since  h(g(w))  =  f(w)  for  all  w  in  U[zl, 
the  polynomial 
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E  :=  (L2  +  1)(LQ  +  1)  -  (Lx  +  l)2  -  (Lx  +  l)(L2  +  l)S2, 

gives  an  algebraic  (rational)  difference  equation  for  f  ,  i.e. 

[y(t  -  2)  +  l][y(t)  +  1]  -[y(t  -  l)  +  l]2  - 

-  [y(t  -  1)  +  l][y(t  -  2)  +  l]u(t  -  2)  =  0, 

for  all  input /output  pairs  (y,  u)  of  f  .  The  polynomial  E  is 

easily  seen  to  be  irreducible,  and  E  is  of  minimal  order  for  f  , 

u   -  -  ° 

because  h  =  T)p  -  1  and  h  =  t)-.t)_  -  1  are  algebraically 

independent . 

XT 

(18.8)   EXAMPLE.  Since  always  trdegR  (^  <  trdeg  CL.,   there  is 
always  an  algebraic  difference  equation  of  degree  dim  Z  .  There  may 
exist,  however,  equations  of  order  strictly  lower  than  dim  Z.   To 
illustrate  this,  consider  the  bilinear  input /output  map  f..  =  f  „  , 
where  m  =  2,  p  =  1,  and  S.  :=  (k    ,  P,  h,  0),   n  >  1  arbitrary, 
and  P,  h  are  given  by  the  equations 

xx(t  +  1)  =  u^t),  xn+1(t  +  1)  =  u2(t), 

x2(t  +  l)  »  x^t),  xn+2(t  +  1)  =  xn+1(t), 


Xn(t  +  1}  -  Xn-l(t)>  X2n^  +  X>  =  X2n-l(t)> 

X2n+l(t)  -  "h**)^)  +  X2n<t)ul(t>> 

It  is  easily  shown  that  Z   is  canonical,  so  dim  Qu     =  dim  Z  =  2n  +  1. 
However,   f   satisfies  the  (affine)  difference  equation 


y(t  +  1)  =  ux(t  -  n)ug(t)  +  u2(t  -  n)ux(t), 


of  order  n  +  1  <  2n  +  1.  Note  that  f,   is  also  a  counterexample  to 

K  3 

dim  L_  =  dim  L_.  Counterexamples  with  m  =  1  also  exist;  e.g.  X„  :=  k  , 

x'  :=  0  and  Z   given  by 
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x^t  +.1)  =  u(t), 
x2(t  +  1)  =  x5(t), 

x^t  +  1)  =  x?(t)x1(t)  +  x^t)  +  x2(t)u(t), 
y(t)  =  XjCt), 

which  satisfies  an  affine  equation  of  order  2: 

y(t)  =  y(t  -  l)u(t  -  2)  +  y(t  -  2)u(t  -  l)  +  u(t  -  2). 

(18.9)   EXAMPLE.  When  £_  is  a  polynomial  system,  it  is  natural  to 
represent  it  "by  a  system  of  simultaneous  polynomial  difference  equations. 
An  important  question  in  applications  concerns  the  minimal  possible 
number  r  of  equations  in  such  representations  of  a  given  £  .  By 
(3.Uff),  r  =  smallest  possible  cardinality  of  a  set  of  generators  for 
Z  .  In  general,  r  <  trdeg  A  ,  and  equality  holds  if  and  only  if 
A-  =  k[T..,  ...,  T  ],  which  is  always  the  case  for  linear  response  maps 
f.  A  result  of  KAIMAN  [19761  shows  that  Af  is  a  polynomial  ring  also 
for  bilinear  single-output  (p  =  l)  response  maps.  For  more  general 
response  maps,  even  bounded  ones,  X-  may  be  different  from  affine 
space . 


For  instance, 
ying 

let  m 

- 

P  = 

1  and 

let 

f3 

be 

the 

response  m 

y(t)  = 

u2(t  - 

2)u(t 

- 

1) 

+  u3(t  - 

•  2) 

f5  -  v 

where 

X   : 
*3 

= 

*2, 

x*  :  = 
V 

=  0 

and 

S 

has 

equations 

Then 


x^t  +  1)  =  u(t), 
x2(t  +  1)  «  x2(t)u(t)  +  xj(t), 
y(t)  =x2(t). 

2   3 
Since  Z   is  quasi-reachable,  A_  =  A(Z)  =  k[ tj,,  t^,  t)2]  .  Thus  to 

represent  the  canonical  realization  (polynomial  by  (l6.2c))  T.         one 

3 
needs  at  least  3  equations.  (Note  that  the  noncanonical  realization 

Z_  of  f   requires  only  2  equations'.)  A  representation  of  S.   is, 

3      3  x5 

for  instance, 


xx(t  +  1)  =  u2(t), 
X^t  +  1)  =  u3(t), 

<3< 


x  (t  +  1)  =  xx(t)u(t)  +  x2(t),   x*  =  0, 


y(t)  =  ^(t), 

where  Xf  =  {(x^  xg,  xj  in  k5  |  x3  =  xg),  a  surface  with  a 
singularity  at  x  =  0.  Since  dim  Xf  =  2,  £   is  minimal,  a  fact  which 
agrees  with  E   being  quasi-reachable  and  abstractly  observable,  so 
weakly  canonical  (13.7).  D 

(l8.10)  EXAMPLE.  Consider  a  homogeneous  polynomial  response  map  f; 
i.e.  all  monomials  in  fr    have  the  same  degree,  say  s.  Take,  for 
simplicity,  m  =  p  =  1.  One  way  of  applying  the  theory  of  multilinear 
response  maps  (15. 3)  for  obtaining  realizations  of  such  f  is  the 
following.  Let 


*li  ' 
s 


*f  *Z%...l   lli1*"1] 
1    s   1 

where  the  sum  runs  over  all  possible  sequences  (with  repetitions) 
i  ,  . . . ,  i   of  integers  2 

_L  S  " 

(with  m  =  s,  p  =  1)  by 


i  ,  ...,  ±m     of  integers  >  1.  Define  the  s-linear  response  map  f 


1    s   1     s 

It  is  easily  verified  that  f  is  finitely  realizable  iff  f   is 
finitely  realizable.  Given  any  realization  E  of  f  ,  a  realization 
£  of  f  is  obtained  by  applying  the  same  input  to  all  the  input  channels 
of  £.  Such  a  method  of  realizing  a  homogeneous  f  has  been  suggested  by 
several  authors  (see,  for  instance,  BUSH  [I965D. 

The  method  is  theoretically  unsatisfactory,  however,  given  the 
noncanonical  nature  of  E.  The  following  example  shows  that  this 
procedure  may  be  also  unsatisfactory  from  a  practical  (synthesis)  point 
of  view.  Specifically,  we  shall  give  a  homogeneous  response  map  f  of 
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(arbitrary)  degree  s  such  that  Z   has  X  =  k  but  such  that  the 
lowest  possible  dimension  for  a  realization  Z  obtained  by  the  above 
procedure  is  s.  That  is,  we  look  for  an  f  with  X_  =  k  and 
dim  ILq  =  s. 

Let  E    have  the  equations  (with  X  =  k) 

x(t  +  1)  =  x(t)  +  u(t),   xfi  :=  0, 
y(t)  =xs(t). 

o  s 

Then  f ,=  f„  ,  where  Z,   has  m  =  s,  p  =  1,  X  =  k   and  equations 


y(t)  =  Xl(t)...xs(t). 


x±(t   +  1)  =  xi(t)  +  tt±(t),  x*  :=   0,   1=1, 


Clearly,  2,   is  reachable.  Moreover,  S,   is  algebraically  observable, 
since,  writing  A(kS)  =  k[T]-,  ...,  r\   ],  the  coefficient  of  T  . .  .f. .  ..T 

X  S  _L     X     S 

(T.  omitted)  in 
v  l 

A(P)h  =  U-l  +  T1)...(tis  +  Tg), 

is  T) . .   Thus  2,   is  canonical  and  has  dimension  s.  O 

(l8.ll)  EXAMPLE.  Consider  strictly  recursive  equations 

(*)     y(t)  =  R(y(t  -  1),  ...,  y(t  -  r),  u(t  -  1),  ...,  u(t  -  r)), 

for  finitely  realizable  polynomial  responses  f,  where  R  is  a 
polynomial.  Such  equations  are  well-known  to  exist  for  linear  response 
maps.  By  the  Cay ley- Hamilton  Theorem,  the  internally-bilinear  response 
maps  (15 .k)   of  systems  of  the  form 

x(t  +  1)  =  Fx(t)u(t), 
y(t)  =  Hx(t), 

(m  =  1,  F,  H  =  linear)  are  easily  seen  to  also  satisfy  equations  (*). 
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In  general,  however,  a  finitely  realizable  polynomial  response 
map  satisfies  no  equation  such  as  (*).  To  construct  counterexamples 
it  is  enough  to  exhibit  systems  2  such  that  for  each  r  >  1  there 
exist  pairs  of  inputs  w,  w  for  which  w(t)  =  w(t)  for  t  >  0, 
£(w)(t)  =  f(w)(t)  for  0<t<r  but  f(w)(r)  4   f  (w)(r) .  Such  w,  w 
clearly  contradict  (*).  We  give  three  such  counterexamples. 

(a)  A  one-dimensional  system  2   with  m  =  p  =  1,  where 

x(t  +  1)  =  x(t)  +  u(t),  x**  :=  0, 
y(t)  =x2(t). 

Here  take  w(t)  =  w(t)  :=  0  if  t  ^  -  1,  r  -  1,  w(r  -  l)  =  w(r  -  l)  :=  1, 
w(-  1)  :=  -  1  and  w(-  l)  :=  1. 

(b)  Again  m  =  p  =  1,   and  XL-  given  by 

xx(t  +  1)  =  x±(t)   +  X;L(t)u(t),  xj  :=  1, 
x2(t  +  1)  =  x2(t)  +  x2(t)u(t),  x*  :=  1, 
y(t)  =  x^t)  +  x2(t). 

Here  take  w(t)  =  w(t)  :=  0  if  t  ^  -  1,  r  -  1,  w(r  -  l)  =  w(r  -  l)  :=  (  l_), 

*         1  ~ 

w(-  l)  :=  0  and  w(-  l)  :=  (  , ).  Note  that  f„   is  an  internally-bilinear 

-  1  L6 

response  map. 

(c)  Let  m  =  2,  p  =  1,  and  2   given  by 

X]_(t  +  1)  =  xx(t)  +  ux(t),  x*  :«  0, 
x2(t  +  l)  =  x2(t)  +  u2(t),  xl   :=  0, 
y(t)  =  Xl(t)x2(t). 

Here  take  w(t)  =  w(t)  :=  0  if  t  ^  -  1,  r  -  1,  w(r  -  l)  =  w(r  -  l)  = 

=  w(-  l)  :=  (,)  and  w(-  l)  :=  -  (,).  Note  that  tv       is  a  bilinear  response 

map.  "J 


CHAPTER  V.   STATE-AFFINE  SYSTEMS 

Some  special  types  of  system  configurations  arose  in  Theorem  (15.5) 
as  natural  realizations  for  bounded  input /output  maps.  The  most  useful 
of  these  configurations  are  state-affine  systems;  they  are  studied  in 
this  chapter. 

As  indicated  in  Chapter  I,  we  shall  base  our  development  on  the 
notion  of  a  representation  of  the  Volterra  (or,  equivalently,  the 
exponent)  series  of  f. 

19.  Recognizable  Series. 

Throughout  this  section,  cp  is  an  exponent  series  (5-6)  whose 
support  is  contained  in  A  ,  where 

J  =  {6o  =  (0),   51,  ...,  5g}. 

(19.1)   DEFINITION.  A  representation  (with  support  in  Aj)  is  an 
object  R  =  (X,  {Fa,  a     in  J},  {g^,  a  =  b±,    ...,  Sg},  h),  where 

(a)  X  is  a  vector  space  over  k, 

(b)  each  F  :  X  ->  X  is  a  linear  map, 

(c)  each  g   is  in  X,  and 

(d)  h:  X  -»Y  =  kr  is  a  linear  map. 

For  each  a  =  aL...afc  in  J*  let  F^  :=  Fa  ...F  ,   and  write  FA  :=  1^. 
If  y   »  0L...O.  (0)...(0)  is  in  J,  with  a.    ±   (0),  then 

if  7  =  (0)...(0),  g  :=  0.  With  these  notations,   R  is  accessible  iff 


R  is  reduced  iff 
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O  T*  ker  hF  =  {0}; 
a  in  J       a  ' 


E  is  canonical  iff  R  is  both  accessible  and  reduced.   The  dimension 


represented  by  R  is  given  by 

K   -— — — — —— 

cpR(a)  :=  hga  for  all  a     in  Aj. 

The  representation  R  is  minimal  iff  f  for  any  R  for  which  cp  =  cp~ 
necessarily  dim  R  <  dim  R.  When  dim  R  <  °°,     cp   is  recognizable.   The 
linear  map  T:  X  -h>  X  induces  a  morphism  of  representations 

T:  R  =  (X,  (Fa),  [g^],  h)  -»R  =  (X,  {Fa},  {g^,  h), 

iff  Tg  =  g   and  T°F  =  F  °T  for  all  a     and  h  =  h°T. 

The  terminology  "recognizable"  is  taken  from  automata  theory,  as 
explained  in  Chapter  I. 

It  is  easy  to  see  that  representations  form  a  category  with  the 
above  notion  of  morphism. 

(19.2)  DEFINITION.  The  behavior  matrix  B(cp)  of  cp  is  an  infinite 
block  matrix,  with  rows  indexed  by  J*  and  columns  indexed  by  A  , 
whose  (a,    g)-th  entry  is  cp(ap),   a  column  vector  in  Y  =  k  . 

We  denote  by  B_  the  p-th  column  of  B(cp). 
p  _ 

(19.3)  THEOREM.  Any  cp  has  a  canonical  representation  R  .  _If  R 
and  R  are  two  canonical  representations  of  cp,   there  exists  a  unique 
representation  isomorphism  T:  R  ->  R.  Further,   cp  is  recognizable  if 
and  only  if  rank  B_(cp)  <  °°;  in  this  case  a  representation  R  of  cp  is 
canonical  (i)  if  and  only  if  R  is  minimal  and  (ii)  if  and  only  if 

dim  R  =  rank  B_(cp). 


10J 


PROOF.  We  define  R  =  (X,  [F   ),    {ga),   h)  as  follows: 

X  :=  linear  space  spanned  by  {B  ,   p  in  Aj}, 

g5.  :=B5.'  i   =  1'  ""  S' 
1      1 

h  :=  linear  map  induced  "by  the  projections  in  the  A-th  block 

row:  B.  »-*cp(p),   and 
P 

F~   :  =  linear  map  induced  by  the  column  shifts  B  »-» B   , 
oi  P    °jP 

i  =  0,  . ..,  s. 

Note  that  the  F   are  well-defined,  since  any  relation  among  columns 

(19-u)     («&t.)  Ve  '  °>    re   in  k' 

implies  Z  r„Bc  Q  =  0,   i  =  0,  ...,  s.  Indeed,  the  a-th  (block)  row 

P  °iP 

of  Z  r  Bc  _  is  Z  rDcp(aS.p),  which  is  also  the  06. -th  row  of  (19. k), 

P  oj_p  pi  1 

and  hence  zero. 

Clearly, 

p         p1...pt_1  pt         p 

for  any  p  in  A,,   so  R  is  accessible.  By  definition  of  h, 
cpR  (P)  =  h(g_)  =  h(B  )  =  cp(p), 

so  R   represents  cp.  Now  take  x  =Z  r0BQ  in  Q  ker  hF  .  Then 

cp  p  p      Ct       u 

0  =  hF  x  =  Z  r  Itf  Q    n   gn  =  Z  r  cp(ap)  =  a-th  block  row  of  x, 
a      p  ap,  •  •  •p^._]_  p+     P 

for  all  a  in  J  .  Thus  x  =  0  and  R  is  reduced,  hence  canonical. 

The  proof  of  the  theorem  will  be  complete  after  we  establish 
the  following  lemma  (similar  to  (7-7)  and  (11. 3)): 

(19.5)   LEMMA.  Let  R,  R  be  representations  of  cp,  with  R  accessible 
and  R  reduced.  Then  there  exists  a  unique  representation  morphism 
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T:  R  ->R.  When  R  is  canonical,   T  is  onto. 

PROOF.  Define  T:  X  -»X  as  the  linear  extension  of 


T(ga)  :=ga,     ex     in  Aj. 


By  definition  of  representation  morphism,  this  is  clearly  the  only 
possible  choice  for  T.  Thus  the  lemma  will  follow  if  T  is  well- 
defined.  We  must  show  that  if  x  =  Z  rQgQ  =  0  (finite  sum)  then  also 

*      p  P 
x  :=  Z  r0g0  =  0.  Pick  a  in  J  .  Then 
P  P 

h»Fa(x)  =  Z  rh(L)  =  Z  rpcp(aP)  =  Z  rph(gap)  =  hFa(x)  =  0. 


Since  R  is  reduced,  this  means  that  x 


0. 


(19.6)   REMARK.  The  above  theorem  gives  rise  to  an  algorithm  for 
constructing  representations  of  a  recognizable  series  cp  from  I|(cp)  • 
It  is  only  necessary  for  this  purpose  to  find  a  submatrix  0  of  full 
rank  of  E;(q))  and  to  express  the  g  ,   F   and  h  with  respect  to  the 
basis  consisting  of  the  set  of  columns  used  in  the  definition  of  $. 
This  algorithm  is  a  minor  variation  of  that  given  by  FLIESS  [  1972]  and 
generalizes  the  one  given  for  the  linear  case  by  ROUCHALEAU  [1972]. 


(19.7)   EXAMPLE.  Let  m  =  p  =  1  and  take  a  linear  (15-2)  response 
map  f,  „*»**  Yf 


with  ^  =  Z  &^1±'     <Pf  =  (  E  ai01)l.  The  only  possible 


nonzero  columns  of  B(cp  )  are  those  indexed  by  1,  01,  ...  , 
only  possible  nonzero  rows  are  those  indexed  by  {0  ,   n  >  0): 


and  the 


1 

01 

021  .  . 

.   oni   ... 

A 

ai 

a2 

a3   .  . 

'   an+l   *  '  ' 

0 

a2 

*5 

\       •    ' 

•   an+2   *  '  • 

0^ 

a3 

\ 

a5   .  . 

a     ... 

n+3 

„n 

• 

0 

an+l 

; 

; 
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the  classical  Hankel  matrix  of  the  linear  response  map  f .  When 
m,  p  ^  1,  the  only  possible  nonzero  part  of  B(<p)  becomes,  with  a 
suitable  ordering  of  row  and  column  indexes,  the  block  Hankel  matrix 
of  f;  the  realization  procedure  in  (l6.6)  coincides  with  the  well- 
known  "Silverman's  formulas"  (SILVERMAN  [1971])  D 

(19.8)   EXAMPLE.  When  f  is  internally-bilinear  (15.4),  our  B(cp) 
becomes  the  generalized  Hankel  matrix  introduced  by  ISIDORI  [1973]   and 
FLIESS  [1973]-  When  f  is  bilinear  in  the  input /output  sense  (I5.3), 
the  nonzero  part  of  l(<p)  can  be  arranged  so  as  to  become  the  matrix 
introduced  by  KALMAN  [ 1976] .  □ 

20.  State-Affine  Systems. 

(20.1)   DEFINITION.  The  polynomial  system  2  is  state  affine  iff 

(a)  X  =  k   for  some  integer  n, 

(b)  h:  X  -» ]r  is  a  linear  map, 

(c)  x  =  0,  and 

(d)  for  each  fixed  input  value  u,  P(x,  u)  is  an  affine 
(  =  linear  +  translation)  function  of  x. 

In  other  words,  there  exist  polynomials  p. .(T  ,  ...,  T  )  and 
q_.(T1,  ...,  T  )  such  that  the  transition  equations  for  £  are  given 
by. 


Xi(t  +  1)  =  XQ  PiJ(u(t))Xj(t)  +  q±(u(t)),   1=1,  ...,  n. 

£  is  span- reachable  iff  the  set  of  reachable  states  X_  spans  k  ; 
span-canonical  iff  both  span- reachable  and  observable;  minimal  iff 
dim  L  <  dim  T,     for  any  state-affine  system  S  for  which  f„  =  f ~.  A 
system  morphism  T :  £  ->  Z    between  state-affine  systems  is  a  morphism 
of  state-affine  systemsi  iff  T :  X  -» X  is  linear . 

(20.2)   REMARK.  It  follows  from  Theorem  (15.5)  that  a  polynomial 
response  map  f  is  bounded  and  finitely  realizable  iff  f  can  be 
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realized  "by  a  system  which  is  state  affine  except  for  the  condition 

it  it 

x  =  0.  A  coordinate  translation  may  of  course  be  used  to  make  x  =  0, 

without  changing  the  affine  form  of  P.  However,  this  makes  h  an 
affine,  rather  than  a  linear,  map.  In  other  words,  the  equilibrium- 
level  output  h(x  )  may  become  nonzero.  We  shall  assume  for  the  rest 
of  this  chapter  that  a  coordinate  translation  has  been  performed  (if 
necessary)  on  Y  =  k   so  that 

cp  (A)  =  0  for  all  response  maps  f . 

With  this  convention,  h(x*)  =  0  for  every  system.  Thus  bounded  + 
finitely  realizable  =  realizable  by  a  state-affine  system.  C 

(20.3)   REMARK.  The  basic  observables  of  a  state-affine  system  are 
affine  functions  of  the  state;  thus 

(abstract)  observability  =  algebraic  observability, 
for  such  systems.  ^ 

Let  S  be  a  state  affine  system.  Since  P  is  affine  in  x  and 
polynomial  in  u,  there  exists  a  subset  J  =  {Sq  =  (0),  b^,    ...,  5g) 

of  N*1  and  matrices   {F&  ,  b±     in  J}  and  {g&  ,   1  =  1,...,  s} 

i  i 

such  that 

s      c     s      ~ 
(20.  k)       P(x,  u)  =  X  F&  xu  i  +  ^  g5  u  1. 

i  i 

We  shall  call  R  =  R^  :=  (kn,  {F  ,  i  -  0,  ...,  s},  {g^,  i  =  1,  ...,  s), 
h)  the  representation  associated  to  Z.  Take  w  =w.fc...w1  in  U  .  An 
easy  calculation  shows  that 

(20.5)   P(t)(x,  w)  =   Z  .  (F  x  +  g  )wa. 
a  in  J   u    u 

Thus  the  t-step  reachability  map  gt:  U  ->  k   becomes 
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(20.6)   g. (w)  =  P(t)(0,  w)  =   Z  .  g  wC 
z  a  in  Jz     a 


and  therefore 

fE(w)  =  h(gt(w))  =  Z  hCg^w01  =  E  cpR(a)wa, 

so  that  cp  =  cp  . 

I     K 

We  now  have  proved  the  following 

(20.7)  LEMMA.  The  assignment  £  -» R_  is  a  bisection  between  state 
affine  realizations  of  a  bounded  polynomial  response  map  f  and 
finite-dimensional  respresentations  (with  fixed  basis  for  X)  of  cp  . 

The  above  assignment  preserves  reachability  and  observability 
properties: 

(20.8)  LEMMA.  £  is  span- reachable  [respectively  observable]  if  and 
only  if  R„  is  accessible  [respectively  reduced]. 

PROOF.  Let  b:  k  -> k  be  a  linear  function,  and  take  w  in 
U  .  Then  (19.6)  implies  that 


(20.9)   b(gt(w))  =   Z  t  b(ga)w 


a  in  J1 

Since  k  is  an  infinite  field,  the  right  side  of  (20. 9)  is  zero  for  all 
w  in  U   iff  bfa;  )  =  0  for  all  a     in  J  .  Thus  there  exists  a 
b  1   0  with  b(g  (w))  =  0  for  all  w  in  U[z]   (i.e.,  Z     is  not 
span-reachable)  iff  there  is  a  b/0  with  Mg^)  =  °  for  a11  a 
(i.e.,  Ry  is  not  accessible). 

t 
We  now  prove  the  observability  part.  Take  w  in  U  .  Then 

(20.10)  hW(x)  =  hoP(t)(x,  w)  =   Z  t  h(F  x  +  gjw01. 

a  in  J     u     u 
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Two  states  x  and  x  are  indistinguishable  iff  h  (x)  =  h  (x)  for  all 
w  in  U  ,  which  by  (20.10)  means  that 

(20.11)  Z   hF  (x  -  x)wa  =  0  for  all  w  in  U*  and  all  t  >  0. 
a  in  J*   a 

Using  again  the  fact  that  k  is  infinite,  (20. ll)  is  equivalent  to 

-.  *     r 

x  -  x  being  in  ker  hF  .   for  all  a     in  J  .  So  h   is  one-to-one 

iff  H  ker  hF  =  {0},   as  required, 
a 

(20.12)  REMARK.   The  hypothesis  k  =  infinite  is  essential  for  the 
above  result  in  its  present  form.  The  correct  generalization  to 
arbitrary  fields  is  that  only  monomials  u  i  which  are  linearly 
independent  as  functions  should  be  used  in  the  definition  of  a  state 
affine  system. 

From  (20.7),  (20.9),  and  (19.3)  we  obtain  the  main  result  of  this 
section: 

(20.13)  THEOREM.  Any  bounded  and  finitely  realizable  response  map 
f  has  a  span- canonical  state-affine  realization,  unique  up  to 
isomorphism  (  =  change  of  basis  in  the  state-space).  A  realization  Z 
of  f  is  span- canonical  (i)  if  and  only  if  S  is  minimal  and  (ii)  if 
and  only  if  dim  2  =  rank  B_(cp  ). 

Of  course,  realizations  can  be  obtained  from  B(cp  )  using  the 
algorithm  in  (19.I4).  There  is  an  interesting  interpretation  of  the 
row- space  R„  of  B_(cp  ),  which  implies  a  natural  duality  between  the 
observation  space  L_  and  the  state  space  of  the  span-canonical  state- 
affine  realization  of  f  (i.e.,  the  column  space  of  B(q>J).  Indeed, 
define 

on  generators  by 

(20.11+)  T](f*)  :=   Z  t  q>.(ap)wa  for  w  in  u\   j  =  1,  ...,  p,   t  >  0. 

0    a  in  J   J 


109 


It  is  not  difficult  to  verify  the  following 

(20.15)  PROPOSITION,  ij:  L_  -  Rf .  In  particular,   dim  L^  =  rank  B(cpf)  = 
-   dimension  of  span-canonical  state-affine  realization  of  f .  C 

21.  Finite  Response  Maps  and  Cascades  of  Linear  Systems. 

(21.1)  DEFINITION.  The  total  degree  of  the  polynomial  response  map  f 
(or  of  its  Volterra  series  ^„)  is 

tdeg  f  =  tdeg  tf  :=  sup  {||a||  |  \|/f(a)  ^  0}  <  <»; 

f  is  finite  iff  tdeg  f  <  °° . 

Since  obviously  deg  tf  <  tdeg  ^f,  a  finite  response  map  is  in 
particular  bounded. 

(21.2)  EXAMPLES.  The  nonzero  polynomial  response  map  f  is  linear 
if  and  only  if  tdeg  f  =  1;  when  f  is  bilinear,  tdeg  f  =  2.  On  the 
other  hand,  an  internally-bilinear  response  map  f  is  bounded  but  not 
necessarily  finite.  c 

A  natural  class  of  realizations  for  finite  maps  will  consist  of 
feedback-free  interconnections  of  linear  systems,  defined  below: 

(21.3)  DEFINITION.  The  polynomial  system  £  is  a  cascade  of  linear 
systems  iff  X  =  k",  x  =  0  and  there  exists  a  direct  sum  decomposition. 

X  =  Xn  ©  ...  ©X,,   d  >  1, 
1         d7     — 

and  linear  maps  A.:  X.  ->X.,   i  =  1,  ...,  d  such  that  the  transition 
equations  of  £  become 

x.(t  +  1)  =  A.x.(t)  +  B.(x1(t),  ...,  xi_1(t),  u(t)), 
x  (t)  in  X.  for  all  t,  i  =  1,  ...,  d. 

Given  a  decomposition  X  =  XL  ©  . . .  ©  Xd  as  above,  let 
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(21.1+)   Z.  :=  X.  ©  ...  ®XJ}   j  =  1,  ...,  d. 
3  3  d 


Conversely,  for  any  chain  Z ,  C  Z  _  C  . . .  C  2L  =  X,   there  exists  a 
decomposition  {X. 
is  easy  to  prove. 


decomposition  {X.}  such  that  (21.1+)  holds.  Thus  the  following  result 


(21.5)  LEMMA.  The  state-affine  system  2  is  a  cascade  of  linear 
systems  if  and  only  if  there  exists  a  chain  of  subspaces 

f03=zd+1czdc...cz1=x, 

such  that,  with  the  notations  of  (20. h) , 

FZ.CZ.,   j  a  1,  ...,  d,   and 
o  J    3 

F  Z.  C  Z   ,   i  =  1,  ...,  s,   j  =  1,  ...,  d. 
§±   J  -  J+l 

For  any  a  =  a.... a   in  supp  if,      let 

8 (a)  :=  number  of  nonzero  vectors  among  a.,  ...,  a.. 

Clearly, 

»(a)  <  |M|  <  d. 
The  main  result  of  this  section  is  the  following 

(21.6)  THEOREM.  The  following  statements  are  equivalent  for  any 
polynomial  response  map  f : 

(a)  f  is  bounded  and  the  span  canonical  state-affine 
realization  of  f  is  a  cascade  of  linear  systems. 

(b)  f  is  realizable  by  a  cascade  of  linear  systems. 

(c)  f  is  a  finitely  realizable  finite  map. 

PROOF,   (a)  =*>  (b)  Trivial. 


Ill 


(b)  =>  (c)  Let  f  =  f  ,  with  E  as  in  (21. 3).  Let  each  G. 

^  1 

have  total  degree  s.   and  let  h  have  total  degree  s.   It  follows  by- 
induction  on  i  that 

tdeg^<ssr..sn. 

(c)  =*>  (a)  Let  tdeg  \|f-,  =  d.  Referring  to  the  realization 
constructed  via  (19-3)?  let 

Z.    :=   span  {Bp,   p  in  Aj,   j  <  «(3)J,   J  =  1,  ...,  d  +  1. 
The  result  is  then  clear  by  (21. 5). 


22.  Rationality. 

We  now  indicate  how  the  automata- theoretic  notion  of  rationality- 
generalizes  to  the  present  context.  The  convention  (20.2)  that  cp(A)  =  0 
for  every  exponent  series  is  still  in  force. 

(22.1)   DEFINITION.  A  scalar  (p  =  l)  exponent  series  cp  is  rational 
iff  cp  can  be  expressed  in  terms  of  finitely  many  vectors  in  g   by 
means  of  a  finite  number  of  any  of  the  following  three  types  of 
operations: 

(i)      (cp,   cp)  •->  rep  +  scp,      for  any     r,    s     in     k, 

(ii)      (cp,   cp)  Mcpcp, 

(iii)     cp  »->cp      :=    Z     cp  , 
i>0 

* 
where  cp   is  interpreted  as 

* 


(a)  :=   E   cp  (a),   for  all  a     in  A 


is  a 


A  vector  exponent  series  (cp  ,  ...,  cp  )»   is  rational  iff  each  cp   is 


P 
rational. 
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(22.2)   THEOREM  (KLEENE-SCHUTZENBERGER).  The  exponent  series  cp  is 
rational  if  and  only  if  cp  is  recognizable. 

PROOF.  See  EILENBERG  [19lh,   Theorem  VII. 5. 1] . 

The  notion  of  rationality  can  be  interpreted  in  terms  of  a 
calculus  of  interconnections  of  state-af f ine  systems .  This  calculus 
permits,  via  exponent  series,  the  determination  of  the  Volterra  series 
of  a  given  system  and,  conversely,  the  construction  of  a  realization 
given  (a  rational  expression  for)  a  Volterra  series.  The  calculus  is 
obtained  as  a  straightforward  translation  of  the  well-known 
manipulations  with  automata,  as  given  for  instance  in  EILENBERG  [ 197^1 • 
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